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been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
sequence PROl 192 (shown in Figure 280 and SEQ ID NO:389) has amino acid sequence identity with trout P0- 
like glycoprotein (GEN12838 IP1). Accordingly, it is presently believed that PROl 192 disclosed in the present 
application is a newly identified member of the myelin PO glycoprotein family. 

5 

123. Full-length PROl 160 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 160. In particular, cDNA encoding a PROl 160 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
10 The DNA62872-1509 clone was isolated from a human breast tissue library using a trapping technique 

which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA62872-1509 clone does 
encode a secreted factor. As far as is known, the DNA62872-1509 sequence encodes a novel factor designated 
herein as PROl 160; using the WU-BLAST2 sequence alignment computer program, no significant sequence 
identities to any known proteins were revealed. 

15 

124. Full-length PROl 187 Polypeptides 

The present invention provides newly identifledand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 187. In particular, cDNA encoding a PROl 187 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
20 As far as is known, the DNA62876-1517 sequence encodes a novel factor designated herein as 

PROl 187; using WU-BLAST2 sequence alignment computer programs, no significant sequence identities to any 
known proteins were revealed. 

125. Full-length PRQ1185 Polypeptides 

25 The present inventionprovides newly identified and isol ated nucleotide sequences encoding polypeptides 

referred to in the present application as PROl 185. In particular, cDNA encoding a PROl 185 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA62881-1515 clone encodes a novel factor designated herein as PROl 185; 
using WU-BLAST2 sequence alignment computer programs, no significant sequence identities to any known 

30 proteins were revealed. 

126. Full-length PRQ1345 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1345. In particular, cDNA encoding a PRO 1345 polypeptide has 
35 been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PR01345 (shown in Figure 288 and SEQ ID NO:403) has amino acid sequence identity with 
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the C-type lectin homolog precursor protein of bos taurus (BTU22298_1). Accordingly, it is presently believed 
that PR01345 disclosed in the present application is a newly identified member of the C-type lectin protein 
family and may possess activity typical of that family or of the tetranectin protein in particular. 

127. Full-length PRQ1245 Polypeptides 

5 The present invention provides newly identified and isolated nucleotide sequencesencodingpolypeptides 

referred to in the present application as PR01245. In particular, cDNA encoding a PR01245 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DN A64884- 1527 clone was identified using methods that selects for nucleotide sequences encoding 
secreted proteins. As far as is known, the DNA64884-1527 sequence encodes a novel secreted factor designated 
10 herein as PR01245. Using WU-BLAST2 sequence alignment computer programs, some sequence identities to 
known proteins were revealed; however, it was determined that they were not significant. 

128. Full-length PRQ1358 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
15 referred to in the present application as PRO 1358. In particular, cDNA encoding a PRO 1358 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
sequence PR01358 (shown in Figure 292 and SEQ ID NO:4 10) has amino acid sequence identity with RASP-1 . 
Accordingly, it is presently believed that PRO 1358 disclosed in the present application is a newly identified 
20 member of the serpin family of serine protease inhibitors and may possess serine protease inhibition activity, 
protein catabolism inhibitory activity and/or be associated with regeneration of tissue. 

129. Full-length PRQ1195 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
25 referred to in the present application as PROl 195. In particular, cDNA encoding a PROl 195 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
sequence PROl 195 (shown in Figure 294 and SEQ ID NO:412) has amino acid sequence identity with 
MMU28486 1, termed a proline rich acidic protein from Mus musculus, locus MMU28486, Accession: 
30 U28486, database GBTRANS, submitted 06-JUN-1995 by John W. Kasik. Accordingly, it is presently believed 
that PROl 195 disclosed in the present application is a newly identified member of this protein family. 

130. Full-length PRO1270 Polypeptides 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
35 referred to in the present application as PRO1270. In particular, cDNA encoding a PRO1270 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
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Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PRO1270 (shown in Figure 296 and SEQ ID NO:414) has amino acid sequence identity with 
the lectin protein (XLU86699J) of Xenopus laevis. Accordingly, it is presently believed that PRO1270 
disclosed in the present application is a newly identified member of the lectin protein family and may possess 
activity typical of that family. 

5 

131. Full-length PRQ1271 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR01271. In particular, cDNA encoding a PR01271 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
10 As far as is known, the DNA66309-1538 sequence encodes a novel factor designated herein as 

PR01271 ; using WU-BLAST2 sequence alignment computer programs, no significant sequence identities to any 
known proteins were revealed (results further described in the examples below). 

132. Full-length PRQ1375 Polypeptides 

15 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PRO 1375. In particular, cDNA encoding a PRO 1375 polypeptide has 

been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 

sequence PR01375 (shown in Figure 300 and SEQ ID NO:418) has amino acid sequence identity PUT2. 
20 Accordingly, it is presently believed that PR01375 disclosed in the present application has at least one related 

mechanism of PUT2. 

133. Full-length PRQ1385 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequencesencodingpolypeptides 
25 referred to in the present application as PR01385. In particular, cDNA encoding a PR01385 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA68869-1610 clone was isolated from a human tissue library using a trapping technique which 
selects for nucleotide sequences encoding secreted proteins. Thus, the DNA68869-1610 clone does encode a 
secreted factor. As far as is known, the DNA68869-1610 sequence encodes a novel factor designated herein as 
30 PR01385; using the WU-BLAST2 sequence alignment computer program, no significant sequence identities to 
any known proteins were revealed. 

134. Full-length PRQ1387 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
35 referred to in the present application as PR01387. In particular, cDNA encoding a PR01387 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
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Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PR01387 (shown in Figure 304 and SEQ ID NO:422) has amino acid sequence identity with 
the myelin pO protein protein precursor (MYP0_HETFR). Accordingly, it is presently believed that PRO 1387 
disclosed in the present application is a newly identified member of the myelin protein family and may possess 
activity typical of that family. 

5 

135. Full-length PRQ1384 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR01384. In particular, cDNA encoding a PR01384 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
10 Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 

sequence PR01384 (shown in Figure 306 and SEQ ID NO:424) has amino acid sequence identity with NKG2-D 
(AF054819_1; Dayhoff database, version 35.45 SwissProt 35). Accordingly, it is presently believed that 
PR01384 disclosed in the present application is a newly identified member of the NKG2 family and may possess 
MHC activation/inactivation activities typical of the NKG2 family. 

15 

B. PRO Variants 

In addition to the full-length native sequence PRO polypeptides described herein, it is contemplated that 
PRO variants can be prepared. PRO variants can be prepared by introducing appropriate nucleotide changes into 
the PRO DNA, and/or by synthesis of the desired PRO polypeptide. Those skilled in the art will appreciate that 

20 amino acid changes may alter post-translational processes of the PRO, such as changing the number or position 
of glycosylation sites or altering the membrane anchoring characteristics. 

Variations in the native full-length sequence PRO or in various domains of the PRO described herein, 
can be made, for example, using any of the techniques and guidelines for conservative and non-conservative 
mutations set forth, for instance, in U.S. Patent No. 5,364,934. Variations may be a substitution, deletion or 

25 insertion of one or more codons encoding the PRO that results in a change in the amino acid sequence of the 
PRO as compared with the native sequence PRO. Optionally the variation is by substitution of at least one amino 
acid with any other amino acid in one or more of the domains of the PRO. Guidance in determining which 
amino acid residue may be inserted, substituted or deleted without adversely affecting the desired activity may 
be found by comparing the sequence of the PRO with mat of homologous known protein molecules and 

30 minimizing the number of amino acid sequence changes made in regions of high homology. Amino acid 
substitutions can be the result of replacing one amino acid with another amino acid having similar structural 
and/or chemical properties, such as the replacement of a leucine with a serine, i.e., conservative amino acid 
replacements. Insertions or deletions may optionally be in the range of about 1 to 5 amino acids. The variation 
allowed may be determined by systematically making insertions, deletions or substitutions of amino acids in the 

35 sequence and testing the resulting variants for activity exhibited by the full-length or mature native sequence. 

PRO polypeptide fragments are provided herein. Such fragments may be truncated at the N-terminus 
or C-terminus, or may lack internal residues, for example, when compared with a full length native protein. 



346 



WO 99/63088 



PCT/US99/12252 



Certain fragments lack amino acid residues that are not essential for a desired biological activity of the PRO 
polypeptide. 

PRO fragments may be prepared by any of a number of conventional techniques. Desired peptide 
fragments may be chemically synthesized. An alternative approach involves generating PRO fragments by 
enzymatic digestion, e.g. , by treating the protein with an enzyme known to cleave proteins at sites defined by 
particular amino acid residues, or by digesting the DNA with suitable restriction enzymes and isolating the 
desired fragment. Yet another suitable technique involves isolating and amplifying a DNA fragment encoding 
a desired polypeptide fragment, by polymerase chain reaction (PCR). Oligonucleotides that define the desired 
termini of the DNA fragment are employed at the 5 ? and 3' primers in the PCR. Preferably, PRO polypeptide 
fragments share at least one biological and/or immunological activity with the native PRO polypeptide disclosed 
herein. 

In particular embodiments, conservative substitutions of interest are shown in Table 1 under the heading 
of preferred substitutions. If such substitutions result in a change in biological activity, then more substantial 
changes, denominated exemplary substitutions in Table 1, or as further described below in reference to amino 
acid classes, are introduced and the products screened. 
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Table 1 



Original 


Exemplary 


Preferred 


Residue 


Substitutions 


Substitutions 


Ala (A) 


val; leu; ile 


val 


Arg (R) 


lys; gin; asn 


lys 


Asn (N) 


cln: his: lvs: arc 


cln 


Asp (D) 


alu 


filu 


Cys (C) 


ser 


ser 


Gin (Q) 


asn 


asn 


Glu (E) 


asp 


asp 


Glv (G) 


pro; ala 


ala 


His (H) 


asn* sin* lvs* arc 


are 


He (D 


leu; val; met; ala; phe; 






norleucine 


leu 


Leu (L) 


norleucine; ile; val; 






met; ala; phe 


ile 


Lvs (K) 


arg; gin; asn 


are 


Met (M) 


leu; phe; ile 


leu 


Phe (F) 


leu; val; ile; ala; tyr 


leu 


Pro (P) 


ala 


ala 


Ser (S) 


thr 


thr 


Thr (T) 


ser 


ser 


Trp(W) 


tyr; phe 


tyr 


Tyr(Y) 


trp; phe; thr; ser 


phe 


Val (V) 


ile; leu; met; phe; 






ala; norleucine 


leu 



10 



15 



20 



25 



Substantial modifications in function or immunological identity of the PRO polypeptide are accomplished 
30 by selecting substitutions that differ significantly in their effect on maintaining (a) the structure of the polypeptide 

backbone in the area of the substitution, for example, as a sheet or helical conformation, (b) the charge or 

hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. Naturally occurring residues 

are divided into groups based on common side-chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 
35 (2) neutral hydrophilic: cys, ser, thr; 

(3) acidic: asp, glu; 

(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 

40 Non-conservative substitutions will entail exchanging a member of one of these classes for another class. 

Such substituted residues also may be introduced into the conservative substitution sites or, more preferably, into 
the remaining (non-conserved) sites. 

The variations can be made using methods known in the art such as oligonucleotide-mediated (site- 
directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis [Carter et al. , Nucl. 

45 Acids Res.. 13:4331 (1986); Zoller et al., Nucl. Acids Res.. 10:6487 (1987)], cassette mutagenesis [Wells et 
al., Gene . 34*315 (1985)], restriction selection mutagenesis [Wells et al., Philos. Trans. R. Soc. London SerA. 
317:415 (1986)] or other known techniques can be performed on the cloned DNA to produce the PRO variant 
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DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a 
contiguous sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. Such 
amino acids include alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid 
among this group because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the main- 
5 chain conformation of the variant [Cunningham and Wells, Science. 244: 1081-1085 (1989)]. Alanine is also 
typically preferred because it is the most common amino acid. Further, it is frequently found in both buried and 
exposed positions [Creighton, The Proteins . (W.H. Freeman & Co., N.Y.); Chothia, J. Mol. Biol.. 150:1 
(1976)]. If alanine substitution does not yield adequate amounts of variant, an isoteric amino acid can be used. 

10 C. Modifications of PRO 

Covalent modifications of PRO are included within the scope of this invention. One type of covalent 
modification includes reacting targeted amino acid residues of a PRO polypeptide with an organic derivatizing 
agent that is capable of reacting with selected side chains or the N- or C- terminal residues of the PRO. 
Derivatization with bi functional agents is useful, for instance, for crosslinking PRO to a water-insoluble support 

15 matrix or surface for use in the method for purifying anti-PRO antibodies, and vice-versa. Commonly used 
crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide 
esters, for example, esters with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl 
esters such as 3,3'-dithiobis(succinimidylpropionate), Afunctional maleimides such as bis-N-maleimido-1,8- 
octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 

20 Other modifications include deamidation of glutaminyl and asparaginyl residues to the corresponding 

glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl 
groups of seryl or threonyl residues, methylation of the cc-amino groups of lysine, arginine, and histidine side 
chains [T.E. Creighton, Proteins: Structure and Molecular Properties. W.H. Freeman & Co., San Francisco, 
pp. 79-86 (1983)], acetylation of the N-terminal amine, and amidation of any C-terminal carboxyl group. 

25 Another type of covalent modification of the PRO polypeptide included within the scope of this 

invention comprises altering the native glycosylation pattern of the polypeptide. "Altering the native 
glycosylation pattern" is intended for purposes herein to mean deleting one or more carbohydrate moieties found 
in native sequence PRO (either by removing the underlying glycosylation site or by deleting the glycosylation 
by chemical and/or enzymatic means), and/or adding one or more glycosylation sites that are not present in the 

30 native sequence PRO. In addition, the phrase includes qualitative changes in the glycosylation of the native 
proteins, involving a change in the nature and proportions of the various carbohydrate moieties present. 

Addition of glycosylation sites to the PRO polypeptide may be accomplished by altering the amino acid 
sequence. The alteration may be made, for example, by the addition of, or substitution by, one or more serine 
or threonine residues to the native sequence PRO (for O-linked glycosylation sites). The PRO amino acid 

35 sequence may optionally be altered through changes at the DNA level, particularly by mutating the DNA 
encoding the PRO polypeptide at preselected bases such that codons are generated that will translate into the 
desired amino acids. 
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Another means of increasing the number of carbohydrate moieties on the PRO polypeptide is by 
chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the art, e.g., 
in WO 87/05330 published 1 1 September 1987, and in Aplin and Wriston, CRC Grit. Rev. Biochem,, pp. 259- 
306 (1981). 

Removal of carbohydrate moieties present on the PRO polypeptide may be accomplished chemically 
5 or enzymatically or by mutational substitution of codons encoding for amino acid residues that serve as targets 
for glycosylation. Chemical deglycosylation techniques are known in the art and described, for instance, by 
Hakimuddin, et ah, Arch. Biochem. Biophvs.. 25£:52 (1987) and by Edge et al., Anal. Biochem., 118:131 
(1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety 
of endo- and exo-glycosidases as described by Thotakura et al., Meth. Enzvmol., 138i350 (1987). 
10 Another type of covalent modification of PRO comprises linking the PRO polypeptide to one of a variety 

of nonproteinaceous polymers, e.g., polyethylene glycol (PEG), polypropylene glycol, or polyoxyalkylenes, in 
the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 4,301 , 144; 4,670,4 17; 4,791 , 1 92 or 4, 179,337. 

The PRO of the present invention may also be modified in a way to form a chimeric molecule 
comprising PRO fused to another, heterologous polypeptide or amino acid sequence. 
15 In one embodiment, such a chimeric molecule comprises a fusion of the PRO with a tag polypeptide 

which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is generally placed 
at the amino- or carboxyl- terminus of the PRO. The presence of such epitope-tagged forms of the PRO can be 
detected using an antibody against the tag polypeptide. Also, provision of the epitope tag enables the PRO to 
be readily purified by affinity purification using an anti-tag antibody or another type of affinity matrix that binds 
20 to the epitope tag. Various tag polypeptides and their respective antibodies are well known in the art. Examples 
include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag polypeptide and its 
antibody 12CA5 [Field et al., Mol. Cell. Biol.. 8:2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 6E10, 
G4, B7 and 9E10 antibodies thereto [Evan et al., Molecular and Cellular Biology, 5:3610-3616 (1985)]; and the 
Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky et al. , Protein Engineering 3(6):547- 
25 553 (1990)]. Other tag polypeptides include the Flag-peptide [Hopp et al., BioTechnology, 6:1204-1210 
(1988)]; the KT3 epitope peptide [Martin et al., Science . 255:192-194 (1992)]; an a-tubulin epitope peptide 
[Skinner et al., J. Biol. Chem. . 266:15163-15166 (1991)]; and the T7 gene 10 protein peptide tag [Lutz- 
Freyermuth et al., Proc. Natl. Acad. Sci. USA. 87:6393-6397 (1990)]. 

In an alternative embodiment, the chimeric molecule may comprise a fusion of the PRO with an 
30 immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the chimeric molecule (also 
referred to as an "inimunoadhesnO, such a fusion could be to the Fc region of an IgG molecule. The Ig fusions 
preferably include the substitution of a soluble (transmembrane domain deleted or inactivated) form of a PRO 
polypeptide in place of at least one variable region within an Ig molecule. In a particularly preferred 
embodiment, the immunoglobulin fusion includes the hinge, CH2 and CH3, or the hinge, CHI, CH2 and CH3 
35 regions of an IgGl molecule. For the production of immunoglobulin fusions see also US Patent No. 5,428, 130 
issued June 27, 1995. 
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D. Preparation of PRO 

The description below relates primarily to production of PRO by culturing cells transformed or 
transfected with a vector containing PRO nucleic acid. It is, of course, contemplated that alternative methods, 
which are well known in the art, may be employed to prepare PRO. For instance, the PRO sequence, or 
portions thereof, may be produced by direct peptide synthesis using solid-phase techniques [see, e.g., Stewart 
5 et al. f Solid-Phase Peptide Synthesis. W.H. Freeman Co., San Francisco, CA (1969); Merrifield, J. Am. Chem. 
Soc.. 85:2149-2154 (1963)]. In vitro protein synthesis may be performed using manual techniques or by 
automation. Automated synthesis may be accomplished, for instance, using an Applied Biosy stems Peptide 
Synthesizer (Foster City, CA) using manufacturer's instructions. Various portions of the PRO may be 
chemically synthesized separately and combined using chemical or enzymatic methods to produce the full-length 
10 PRO. 

1. Isolation of DNA Encoding PRO 
DNA encoding PRO may be obtained from a cDNA library prepared from tissue believed to possess 
the PRO mRNA and to express it at a detectable level. Accordingly, human PRO DNA can be conveniently 
15 obtained from a cDNA library prepared from human tissue, such as described in the Examples. The PRO- 
encoding gene may also be obtained from a genomic library or by known synthetic procedures (e.g., automated 
nucleic acid synthesis). 

Libraries can be screened with probes (such as antibodies to the PRO or oligonucleotides of at least 
about 20-80 bases) designed to identify the gene of interest or the protein encoded by it. Screening the cDNA 

20 or genomic library with the selected probe may be conducted using standard procedures, such as described in 
Sambrook et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 
1989). An alternative means to isolate the gene encoding PRO is to use PCR methodology [Sambrook et al., 
supra : Dieffenbach et al., PCR Primer: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 1995)]. 
The Examples below describe techniques for screening a cDN A library. The oligonucleotide sequences 

25 selected as probes should be of sufficient length and sufficiently unambiguous that false positives are minimized. 
The oligonucleotide is preferably labeled such that it can be detected upon hybridization to DNA in the library 
being screened. Methods of labeling are well known in die art, and include die use of radiolabels like 32 P-labeled 
ATP, biotinylation or enzyme labeling. Hybridization conditions, including moderate stringency and high 
stringency, are provided in Sambrook et al., supra. 

30 Sequences identified in such library screening methods can be compared and aligned to other known 

sequences deposited and available in public databases such as GenBank or other private sequence databases. 
Sequence identity (at either the amino acid or nucleotide level) within defined regions of the molecule or across 
the full-length sequence can be determined using methods known in the an and as described herein. 

Nucleic acid having protein coding sequence may be obtained by screening selected cDN A or genomic 

35 libraries using the deduced amino acid sequence disclosed herein for the first time, and, if necessary, using 
conventional primer extension procedures as described in Sambrook et al., supra , to detect precursors and 
processing intermediates of mRNA that may not have been reverse-transcribed into cDNA. 
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2. Selection and Transformation of Host Cells 
Host cells are transfected or transformed with expression or cloning vectors described herein for PRO 
production and cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting 
transformants, or amplifying the genes encoding the desired sequences. The culture conditions, such as media, 
temperature, pH and the like, can be selected by the skilled artisan without undue experimentation. In general, 
5 principles, protocols, and practical techniques for maximizing the productivity of cell cultures can be found in 
Mammalian Cell Biotechnology: a Practical Approach. M. Butler, ed. (IRL Press, 1991) and Sambrook et al., 
supra . 

Methods of eukaryotic ceil transfection and prokaryotic cell transformation are known to the ordinarily 
skilled artisan, for example, CaCl 2 , CaP0 4 , liposome-mediated and electroporation. Depending on the host cell 

10 used, transformation is performed using standard techniques appropriate to such cells. The calcium treatment 
employing calcium chloride, as described in Sambrook et al., supra , or electroporation is generally used for 
prokaryotes. Infection with Agrobacterium tumefaciens is used for transformation of certain plant cells, as 
described by Shaw et al. , Gene . 23 :3 15 (1983) and WO 89/05859 published 29 June 1989. For mammalian cells 
without such cell walls, the calcium phosphate precipitation method of Graham and van der Eb, Virology. 

15 52:456-457 (1978) can be employed. General aspects of mammalian cell host system transfections have been 
described in U.S. Patent No. 4,399,216. Transformations into yeast are typically carried out according to the 
method of Van Solingenetal., J. Bact. . 130:946 (1977) and Hsiao etal., Proc. Natl. Acad. Sci. (USA). 76:3829 
(1979). However, other methods for introducing DNA into cells, such as by nuclear microinjection, 
electroporation, bacterial protoplast fusion with intact cells, or polycations, e.g., polybrene, polyornithine, may 

20 also be used. For various techniques for transforming mammalian cells, see Keown et al., Methods in 
Enzvmology. 185:527-537 (1990) and Mansour et al., Nature. 336:348-352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, yeast, 
or higher eukaryote cells. Suitable prokaryotes include but are not limited to eubacteria, such as Gram-negative 
or Gram-positive organisms, for example, Enterobacteriaceae such as E. coli. Various £. coli strains are 

25 publicly available, such as E. coli K12 strain MM294 (ATCC 31,446); E. coli X1776 (ATCC 31,537); E. coli 
strain W3110 (ATCC 27,325) and K5 772 (ATCC 53,635). Other suitable prokaryotic host cells include 
Enterobacteriaceae such as Escherichia, e.g., E. coli, Enterobacter, Erwinia, Klebsiella, Proteus, Salmonella, 
e.g., Salmonella typhimurium, Serratia, e.g., Serratia marcescans, and Shigella, as well as Bacilli such as B. 
subtilis and B. licheniformis (e.g., B. licheniformis 41P disclosed in DD 266,710 published 12 April 1989), 

30 Pseudomonas such as P. aeruginosa, and Streptomyces . These examples are illustrative rather than limiting. 
Strain W3 1 10 is one particularly preferred host or parent host because it is a common host strain for recombinant 
DNA product fermentations. Preferably, the host cell secretes minimal amounts of proteolytic enzymes. For 
example, strain W31 10 may be modified to effect a genetic mutation in the genes encoding proteins endogenous 
to the host, with examples of such hosts including E. coli W31 10 strain 1 A2, which has the complete genotype 

35 tonA ; E. coli W3110 strain 9E4, which has the complete genotype tonA ptr3; £. coli W3110 strain 27C7 
(ATCC 55,244), which has the complete genotype tonA ptr3 phoA El 5 (argF-lac)169 degP ompTkarf', E. coli 
W3 1 10 strain 37D6, which has the complete genotype tonA ptr3 phoA El 5 (argF-lac)169 degP ompT rbs7 
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ilvG karf \ E. coli W3110 strain 40B4, which is strain 37D6 with a non-kanamycin resistant degP deletion 
mutation; and an E. coli strain having mutant periplasmic protease disclosed in U.S. Patent No. 4,946,783 issued 
7 August 1990. Alternatively, in vitro methods of cloning, e.g. , PCR or other nucleic acid polymerase reactions, 
are suitable. 

In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable cloning 
5 or expression hosts for PRO-encoding vectors. Saccharomyces cerevisiae is a commonly used lower eukaryotic 
host microorganism. Others include Schizosaccharomyces pombe (Beach and Nurse, Nature . 290: 140 [1981]; 
EP 139,383 published 2 May 1985); Kluyveromyces hosts (U.S. Patent No. 4,943,529; Fleer et al., 
Bio/Technology. 9:968-975 (1991)) such as, e.g., K. lactis (MW98-8C, CBS683, CBS4574; Louvencourt et al., 
J. Bacteriol. . 737 [1983]), K. fragilis (ATCC 12,424), K. bulgaricus (ATCC 16,045), K. mckeramii (ATCC 

10 24,178), K. waltii (ATCC 56,500), K. drosophilarum (ATCC 36,906; Van den Berg et al., Bio/Technology. 
8:135 (1990)), K. thermotolerans, and K. marxianus; yarrowia (EP 402,226); Pichia pastoris (EP 183,070; 
Sreekrishna et al., J. Basic Microbiol.. 28:265-278 [1988]); Candida; Trichoderma reesia (EP 244,234); 
Neurospora crassa (Case et al., Proc. Natl. Acad. Sci. USA . 76:5259-5263 [1979]); Schwanniomyces such as 
Schwanniomyces occidentalis (EP 394,538 published 31 October 1990); and filamentous fungi such as, e.g., 

15 Neurospora, Penicillium, Tolypocladium (WO 91/00357 published 10 January 1991), and Aspergillus hosts such 
as A, nidulans (Ballance et al., Biochem. Biophvs. Res. Commun.. 112:284-289 [1983]; Tilburn et al., Gene . 
26:205-221 [1983]; Yeltonet al., Proc. Natl. Acad. Sci. USA . 81: 1470-1474 [1984]) and A. niger (Kelly and 
Hynes, EMBO J., 4:475-479 [1985]). Methylotropic yeasts are suitable herein and include, but are not limited 
to, yeast capable of growth on methanol selected from the genera consisting of Hansenula, Candida, Kloeckera, 

20 Pichia, Saccharomyces, Torulopsis, and Rhodotorula. A list of specific species that are exemplary of this class 
of yeasts may be found in C. Anthony, The Biochemistry of Methvlotrophs, 269 (1982). 

Suitable host cells for the expression of glycosylated PRO are derived from multicellular organisms. 
Examples of invertebrate cells include insect cells such as Drosophila S2 and Spodoptera Sf9, as well as plant 
cells. Examples of useful mammalian host cell lines include Chinese hamster ovary (CHO) and COS cells. 

25 More specific examples include monkey kidney CV1 line transformed by SV40 (COS-7, ATCC CRL 1651); 
human embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture, Graham et al., J._ 
Gen Virol. . 36:59 (1977)); Chinese hamster ovary cells/-DHFR (CHO, Urlaub and Chasin, Proc. Natl. Acad. 
Sci. USA . 77:4216 (1980)); mouse Sertoli cells (TM4, Mather, Biol. Reprod.. 23:243-251 (1980)); human lung 
cells (W138, ATCC CCL 75); human liver cells (Hep G2, HB 8065); and mouse mammary tumor (MMT 

30 060562, ATCC CCL51). The selection of the appropriate host cell is deemed to be within the skill in the art. 

3. Selection and Use of a Replicable Vector 
The nucleic acid (e.g. , cDNA or genomic DNA) encoding PRO may be inserted into a replicable vector 
for cloning (amplification of the DNA) or for expression. Various vectors are publicly available. The vector 
35 may, for example, be in the form of a plasmid, cosmid, viral particle, or phage. The appropriate nucleic acid 
sequence may be inserted into the vector by a variety of procedures. In general, DNA is inserted into an 
appropriate restriction endonuclease site(s) using techniques known in the an. Vector components generally 
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include, but are not limited to, one or more of a signal sequence, an origin of replication, one or more marker 
genes, an enhancer element, a promoter, and a transcription termination sequence. Construction of suitable 
vectors containing one or more of these components employs standard ligation techniques which are known to 
the skilled artisan. 

The PRO may be produced recombinantly not only directly, but also as a fusion polypeptide with a 
5 heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site 
at the N-terminus of the mature protein or polypeptide. In general, the signal sequence may be a component of 
the vector, or it may be a part of the PRO-encoding DNA that is inserted into the vector. The signal sequence 
may be a prokaryotic signal sequence selected, for example, from the group of the alkaline phosphatase, 
penicillinase, lpp, or heat-stable enterotoxin II leaders. For yeast secretion the signal sequence may be, e.g., 

10 the yeast invertase leader, alpha factor leader (including Saccharomyces and Kluyveromyces cc-factor leaders, 
the latter described in U.S. Patent No. 5,010,182), or acid phosphatase leader, the C. albicans glucoamylase 
leader (EP 362,179 published 4 April 1990), or the signal described in WO 90/13646 published 15 November 
1990. In mammalian cell expression, mammalian signal sequences may be used to direct secretion of the 
protein, such as signal sequences from secreted polypeptides of the same or related species, as well as viral 

15 secretory leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate 
in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. 
The origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2/i plasmid 
origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for 

20 cloning vectors in mammalian cells. 

Expression and cloning vectors will typically contain a selection gene, also termed a selectable marker. 
Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g., ampicillin, 
neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical nutrients 
not available from complex media, e.g., the gene encoding D-alanine racemase for Bacilli. 

25 An example of suitable selectable markers for mammalian cells are those that enable die identification 

of cells competent to take up the PRO-encoding nucleic acid, such as DHFR or thymidine kinase. An 
appropriate host cell when wild- type DHFR is employed is the CHO cell line deficient in DHFR activity, 
prepared and propagated as described by Urlaub et al. , Proc. Nad. Acad. Sci. USA . 77:4216 (1980). A suitable 
selection gene for use in yeast is the trp\ gene present in the yeast plasmid YRp7 [Stinchcomb et al., Nature . 

30 282:39 (1979); Kingsman et al., Gene . 7:141 (1979); Tschemper et al., Gene . 10:157 (1980)]. The trp\ gene 
provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for example, 
ATCC No. 44076 or PEP4-1 [Jones, Genetics. 85:12 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the PRO-encoding nucleic 
acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host cells are well 

35 known. Promoters suitable for use with prokaryotic hosts include the p-lactamase and lactose promoter systems 
[Chang et al., Nature . 275:615 (1978); Goeddel et al., Nature . 281:544 (1979)], alkaline phosphatase, a 
tryptophan (tip) promoter system [Goeddel, Nucleic Acids Res.. 8:4057 (1980); EP 36,776], and hybrid 
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promoters such as the tac promoter [deBoer et al., Proc. Natl. Acad. Sci. USA. 80:21-25 (1983)]. Promoters 
for use in bacterial systems also will contain a Shine-Dalgarno (S.D.) sequence operably linked to the DNA 
encoding PRO. 

Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3- 
phosphoglycerate kinase [Hitzeman et al., J. Biol. Chem.. 255:2073 (1980)] or other glycolytic enzymes [Hess 
5 et al., J. Adv. Enzvme Reg.. 7:149 (1968); Holland, Biochemistry. 17:4900 (1978)], such as enolase, 
glyceraldehyde-3-phosphate dehydrogenase, hexokinase,pyruvatedecarboxylase,phosphofructokinase, glucose- 
6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphateisomerase,phosphoglucose 
isomerase, and glucokinase. 

Other yeast promoters, which are inducible promoters having the additional advantage of transcription 

10 controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid 
phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde-3- 
phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and 
promoters for use in yeast expression are further described in EP 73,657. 

PRO transcription from vectors in mammalian host cells is controlled, for example, by promoters 

15 obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,21 1,504 published 5 July 
1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a 
retrovirus, hepatitis-B virus and Simian Virus 40 (SV40), from heterologous mammalian promoters, e.g., the 
actin promoter or an immunoglobulin promoter, and from heat- shock promoters, provided such promoters are 
compatible with the host cell systems. 

20 Transcription of a DNA encoding the PRO by higher eukaryotes may be increased by inserting an 

enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 
bp, that act on a promoter to increase its transcription. Many enhancer sequences are now known from 
mammalian genes (globin, elastase, albumin, a-fetoprotein, and insulin). Typically, however, one will use an 
enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the replication 

25 origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the 
replication origin, and adenovirus enhancers. The enhancer may be spliced into the vector at a position 5* or 
3' to the PRO coding sequence, but is preferably located at a site 5' from the promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated 
cells from other multicellular organisms) will also contain sequences necessary for the termination of 

30 transcription and for stabilizing the mRNA. Such sequences are commonly available from the 5* and, 
occasionally 3', untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide 
segments transcribed as polyadenylated fragments in the untranslated portion of the mRNA encoding PRO. 

Still other methods, vectors, and host cells suitable for adaptation to the synthesis of PRO in 
recombinant vertebrate cell culture are described in Gething et al., Nature . 293:620-625 (1981); Mantei et al., 

35 Nature . 281:40-46 (1979); EP 117,060; and EP 117,058. 
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4. Detecting Gene Amplification/Expression 
Gene amplification and/or expression may be measured in a sample directly, for example, by 

conventional Southern blotting, Northern blotting to quantitate the transcription of mRNA [Thomas, Proc. Natl. 
Acad. Sci. USA . 77:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using an 
appropriately labeled probe, based on the sequences provided herein. Alternatively, antibodies may be employed 
that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes 
or DNA-protein duplexes. The antibodies in turn may be labeled and the assay may be carried out where the 
duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence of antibody bound 
to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 
immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to quantitate 
directly the expression of gene product. Antibodies useful for immunohistochemical staining and/or assay of 
sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, the 
antibodies may be prepared against a native sequence PRO polypeptide or against a synthetic peptide based on 
the DNA sequences provided herein or against exogenous sequence fused to PRO DNA and encoding a specific 
antibody epitope. 

5. Purification of Polypeptide 
Forms of PRO may be recovered from culture medium or from host cell lysates. If membrane-bound, 

it can be released from the membrane using a suitable detergent solution (e.g. Triton-X 100) or by enzymatic 
20 cleavage. Cells employed in expression of PRO can be disrupted by various physical or chemical means, such 
as freeze-thaw cycling, sonication, mechanical disruption, or cell lysing agents. 

It may be desired to purify PRO from recombinant cell proteins or polypeptides. The following 
procedures are exemplary of suitable purification procedures: by fractionation on an ion-exchange column; 
ethanol precipitation; reverse phase HPLC; chromatography on silica or on a cation-exchange resin such as 
25 DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, for example, 
Sephadex G-75; protein A Sepharose columns to remove contaminants such as IgG; and metal chelating columns 
to bind epitope- tagged forms of the PRO. Various methods of protein purification may be employed and such 
methods are known in the an and described for example in Deutscher, Methods in Enzvmology. 182 (1990); 
Scopes, Protein Purification: Principles and Practice. Springer- Verlag, New York (1982). The purification 
30 step(s) selected will depend, for example, on the nature of the production process used and the particular PRO 
produced. 

E. Uses for PRO 

Nucleotide sequences (or their complement) encoding PRO have various applications in the art of 
35 molecular biology, including uses as hybridization probes, in chromosome and gene mapping and in the 
generation of anti-sense RNA and DNA. PRO nucleic acid will also be useful for the preparation of PRO 
polypeptides by the recombinant techniques described herein. 
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The full-length native sequence PRO gene, or portions thereof, may be used as hybridization probes 
for a cDNA library to isolate the full-length PRO cDNA or to isolate still other cDNAs (for instance, those 
encoding naturally occurring variants of PRO or PRO from other species) which have a desired sequence identity 
to the native PRO sequence disclosed herein. Optionally, the length of the probes will be about 20 to about 50 
bases. The hybridization probes may be derived from at least partially novel regions of the full length native 
5 nucleotide sequence wherein those regions may be determined without undue experimentation or from genomic 
sequences including promoters, enhancer elements and introns of native sequence PRO. By way of example, 
a screening method will comprise isolating the coding region of the PRO gene using the known DNA sequence 
to synthesize a selected probe of about 40 bases. Hybridization probes may be labeled by a variety of labels, 
including radionucleotides such as M P or 35 S, or enzymatic labels such as alkaline phosphatase coupled to the 

10 probe via avidin/biotin coupling systems. Labeled probes having a sequence complementary to that of the PRO 
gene of the present invention can be used to screen libraries of human cDNA, genomic DNA or mRNA to 
determine which members of such libraries the probe hybridizes to. Hybridization techniques are described in 
further detail in the Examples below. 

Any EST sequences disclosed in the present application may similarly be employed as probes, using 

IS the methods disclosed herein. 

Other useful fragments of the PRO nucleic acids include antisense or sense oligonucleotides comprising 
a singe-stranded nucleic acid sequence (either RNA or DNA) capable of binding to target PRO mRNA (sense) 
or PRO DNA (antisense) sequences. Antisense or sense oligonucleotides, according to the present invention, 
comprise a fragment of the coding region of PRO DNA. Such a fragment generally comprises at least about 14 

20 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an antisense or a sense 
oligonucleotide, based upon a cDNA sequence encoding a given protein is described in, for example, Stein and 
Cohen (Cancer Res. 48:2659. 1988) and van der Krol et al. ( BioTechniques 6 :958. 1988). 

Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in the formation 
of duplexes that block transcription or translation of the target sequence by one of several means, including 

25 enhanced degradation of the duplexes, premature termination of transcription or translation, or by other means. 
The antisense oligonucleotides thus may be used to block expression of PRO proteins. Antisense or sense 
oligonucleotides further comprise oligonucleotides having modified sugar-phosphodiester backbones (or other 
sugar linkages, such as those described in WO 91/06629) and wherein such sugar linkages are resistant to 
endogenous nucleases. Such oligonucleotides with resistant sugar linkages are stable in vivo (i.e., capable of 

30 resisting enzymatic degradation) but retain sequence specificity to be able to bind to target nucleotide sequences. 

Other examples of sense or antisense oligonucleotides include those oligonucleotides which are 
covalently linked to organic moieties, such as those described in WO 90/10048, and other moieties that increases 
affinity of the oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Further still, 
intercalating agents, such as ellipticine, and alkylating agents or metal complexes may be attached to sense or 

35 antisense oligonucleotides to modify binding specificities of the antisense or sense oligonucleotide for the target 
nucleotide sequence. 
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Antisense or sense oligonucleotides may be introduced into a cell containing the target nucleic acid 
sequence by any gene transfer method, including, for example, CaP0 4 -mediated DNA transfection, 
electroporation, or by using gene transfer vectors such as Epstein-Barr virus. In a preferred procedure, an 
antisense or sense oligonucleotide is inserted into a suitable retroviral vector. A cell containing the target nucleic 
acid sequence is contacted with the recombinant retroviral vector, either in vivo or ex vivo. Suitable retroviral 
5 vectors include, but are not limited to, those derived from the murine retrovirus M-MuLV, N2 (a retrovirus 
derived from M-MuLV), or the double copy vectors designated DCT5A, DCT5B and DCT5C (see WO 
90/13641). 

Sense or antisense oligonucleotides also may be introduced into a cell containing the target nucleotide 
sequence by formation of a conjugate with a ligand binding molecule, as described in WO 91/04753. Suitable 

10 ligand binding molecules include, but are not limited to, cell surface receptors, growth factors, other cytokines, 
or other ligands that bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 
not substantially interfere with the ability of the ligand binding molecule to bind to its corresponding molecule 
or receptor, or block entry of the sense or antisense oligonucleotide or its conjugated version into the cell. 

Alternatively, a sense or an antisense oligonucleotide may be introduced into a cell containing the target 

15 nucleic acid sequence by formation of an oligonucleotide-lipid complex, as described in WO 90/10448. The 
sense or antisense oligonucleotide-lipid complex is preferably dissociated within the cell by an endogenous lipase. 

The probes may also be employed in PCR techniques to generate a pool of sequences for identification 
of closely related PRO coding sequences. 

Nucleotide sequences encoding a PRO can also be used to construct hybridization probes for mapping 

20 the gene which encodes that PRO and for the genetic analysis of individuals with genetic disorders. The 
nucleotide sequences provided herein may be mapped to a chromosome and specific regions of a chromosome 
using known techniques, such as in situ hybridization, linkage analysis against known chromosomal markers, 
and hybridization screening with libraries. 

When the coding sequences for PRO encode a protein which binds to another protein (example, where 

25 the PRO is a receptor), the PRO can be used in assays to identify the other proteins or molecules involved in the 
binding interaction. By such methods, inhibitors of the receptor/ligand binding interaction can be identified. 
Proteins involved in such binding interactions can also be used to screen for peptide or small molecule inhibitors 
or agonists of the binding interaction. Also, the receptor PRO can be used to isolate correlative ligand(s). 
Screening assays can be designed to find lead compounds that mimic the biological activity of a native PRO or 

30 a receptor for PRO. Such screening assays will include assays amenable to high-throughput screening of 
chemical libraries, making them particularly suitable for identifying small molecule drug candidates. Small 
molecules contemplated include synthetic organic or inorganic compounds. The assays can be performed in a 
variety of formats, including protein-protein binding assays, biochemical screening assays, immunoassays and 
cell based assays, which are well characterized in the art. 

35 Nucleic acids which encode PRO or its modified forms can also be used to generate either transgenic 

animals or "knock out" animals which, in turn, are useful in the development and screening of therapeutically 
useful reagents. A transgenic animal (e.g., a mouse or rat) is an animal having cells that contain a transgene, 

358 



WO 99/63088 



PCT/US99/12252 



which transgene was introduced into the animal or an ancestor of the animal at a prenatal, e.g., an embryonic 
stage. A transgene is a DNA which is integrated into the genome of a cell from which a transgenic animal 
develops. In one embodiment, cDNA encoding PRO can be used to clone genomic DNA encoding PRO in 
accordance with established techniques and the genomic sequences used to generate transgenic animals that 
contain cells which express DNA encoding PRO. Methods for generating transgenic animals, particularly 
5 animals such as mice or rats, have become conventional in the art and are described, for example, in U.S. Patent 
Nos. 4,736,866 and 4,870,009. Typically, particular cells would be targeted for PRO transgene incorporation 
with tissue-specific enhancers. Transgenic animals that include a copy of a transgene encoding PRO introduced 
into the germ line of the animal at an embryonic stage can be used to examine the effect of increased expression 
of DNA encoding PRO. Such animals can be used as tester animals for reagents thought to confer protection 
10 from, for example, pathological conditions associated with its overexpression. In accordance with this facet of 
the invention, an animal is treated with the reagent and a reduced incidence of the pathological condition, 
compared to untreated animals bearing the transgene, would indicate a potential therapeutic intervention for the 
pathological condition. 

Alternatively, non-human homologues of PRO can be used to construct a PRO "knock out" animal 

15 which has a defective or altered gene encoding PRO as a result of homologous recombination between the 
endogenous gene encoding PRO and altered genomic DNA encoding PRO introduced into an embryonic stem 
cell of the animal. For example, cDNA encoding PRO can be used to clone genomic DNA encoding PRO in 
accordance with established techniques. A portion of the genomic DNA encoding PRO can be deleted or 
replaced with another gene, such as a gene encoding a selectable marker which can be used to monitor 

20 integration. Typically, several kilobases of unaltered flanking DNA (both at the 5* and 3' ends) are included 
in the vector [see e.g., Thomas and Capecchi, Cdl, 51:503 (1987) for a description of homologous 
recombination vectors] . The vector is introduced into an embryonic stem cell line (e.g. , by electroporation) and 
cells in which the introduced DNA has homologously recombined with the endogenous DNA are selected [see 
e.g., Li et al., Cell, 69:915 (1992)]. The selected cells are then injected into a blastocyst of an animal (e.g., 

25 a mouse or rat) to form aggregation chimeras [see e.g., Bradley, in Teratocarcinomas and Embryonic Stem 
Cells: A Practical Approach, E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 1 13-152]. A chimeric embryo can 
then be implanted into a suitable pseudopregnant female foster animal and the embryo brought to term to create 
a "knock out" animal. Progeny harboring the homologously recombined DNA in their germ cells can be 
identified by standard techniques and used to breed animals in which all cells of the animal contain the 

30 homologously recombined DNA. Knockout animals can be characterized for instance, for their ability to defend 
against certain pathological conditions and for their development of pathological conditions due to absence of 
the PRO polypeptide. 

Nucleic acid encoding the PRO polypeptides may also be used in gene therapy. In gene therapy 
applications, genes are introduced into cells in order to achieve in vivo synthesis of a therapeutically effective 
35 genetic product, for example for replacement of a defective gene. "Gene therapy" includes both conventional 
gene therapy where a lasting effect is achieved by a single treatment, and the administration of gene therapeutic 
agents, which involves the one time or repeated administration of a therapeutically effective DNA or mRNA. 
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Antisense RNAs and DNAs can be used as therapeutic agents for blocking the expression of certain genes in 
vivo. It has already been shown that short antisense oligonucleotides can be imported into cells where they act 
as inhibitors, despite their low intracellular concentrations caused by their restricted uptake by the cell 
membrane. (Zamecnik et al. , Proc. Natl. Acad. Sci. USA 83:4143-4146 [1986]). The oligonucleotides can be 
modified to enhance their uptake, e.g. by substituting their negatively charged phosphodiester groups by 
5 uncharged groups. 

There are a variety of techniques available for introducing nucleic acids into viable cells. The 
techniques vary depending upon whether the nucleic acid is transferred into cultured cells in vitro, or in vivo in 
the cells of the intended host. Techniques suitable for the transfer of nucleic acid into mammalian cells in vitro 
include the use of liposomes, electroporation, microinjection, cell fusion, DEAE-dextran, the calcium phosphate 

10 precipitation method, etc. The currently preferred in vivo gene transfer techniques include transfection with viral 
(typically retroviral) vectors and viral coat protein-liposome mediated transfection (Dzau et al., Trends in 
Biotechnology 11. 205-210 [1993]). In some situations it is desirable to provide the nucleic acid source with 
an agent that targets the target cells, such as an antibody specific for a cell surface membrane protein or the 
target cell, a ligand for a receptor on the target cell, etc. Where liposomes are employed, proteins which bind 

15 to a cell surface membrane protein associated with endocytosis may be used for targeting and/or to facilitate 
uptake, e.g. capsid proteins or fragments thereof tropic for a particular cell type, antibodies for proteins which 
undergo internalization in cycling, proteins that target intracellular localization and enhance intracellular half-life. 
The technique of receptor-mediated endocytosis is described, for example, by Wu et al., J. Biol. Chem. 262. 
4429-4432 (1987); and Wagner et al., Proc. Natl. Acad. Sci. USA 87, 3410-3414 (1990). For review of gene 

20 marking and gene therapy protocols see Anderson et al., Science 256. 808-813 (1992). 

The PRO polypeptides described herein may also be employed as molecular weight markers for protein 
electrophoresis purposes. 

The nucleic acid molecules encoding the PRO polypeptides or fragments thereof described herein are 
useful for chromosome identification. In this regard, there exists an ongoing need to identify new chromosome 

25 markers, since relatively few chromosome marking reagents, based upon actual sequence data are presently 
available. Each PRO nucleic acid molecule of the present invention can be used as a chromosome marker. 

The PRO polypeptides and nucleic acid molecules of the present invention may also be used for tissue 
typing, wherein the PRO polypeptides of the present invention may be differentially expressed in one tissue as 
compared to another. PRO nucleic acid molecules will find use for generating probes for PCR, Northern 

30 analysis, Southern analysis and Western analysis. 

The PRO polypeptides described herein may also be employed as therapeutic agents. The PRO 
polypeptides of the present invention can be formulated according to known methods to prepare pharmaceutical^ 
useful compositions, whereby the PRO product hereof is combined in admixture with a pharmaceutically 
acceptable carrier vehicle. Therapeutic formulations are prepared for storage by mixing the active ingredient 

35 having the desired degree of purity with optional physiologically acceptable carriers, excipients or stabilizers 
(Remington's Pharmaceutical Sciences 16th edition, Osol, A. Ed. (1980)), in the form of lyophilized 
formulations or aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients at the 
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dosages and concentrations employed, and include buffers such as phosphate, citrate and other organic acids; 
antioxidants including ascorbic acid; low molecular weight (less than about 10 residues) polypeptides; proteins, 
such as serum albumin, gelatin or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone, amino 
acids such as glycine, glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides and other 
carbohydrates including glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as 
5 mannitol or sorbitol; salt-forming counterions such as sodium; and/or nonionic surfactants such as TWEEN™, 
PLURONICS™ or PEG. 

The formulations to be used for In vivo administration must be sterile. This is readily accomplished by 
filtration through sterile filtration membranes, prior to or following lyophilization and reconstitution. 

Therapeutic compositions herein generally are placed into a container having a sterile access port, for 

10 example, an intravenous solution bag or vial having a stopper pierceable by a hypodermic injection needle. 

The route of administration is in accord with known methods, e.g. injection or infusion by intravenous, 
intraperitoneal, intracerebral, intramuscular, intraocular, intraarterial or intralesional routes, topical 
administration, or by sustained release systems. 

Dosages and desired drug concentrations of pharmaceutical compositions of the present invention may 

15 vary depending on the particular use envisioned. The determination of the appropriate dosage or route of 
administration is well within the skill of an ordinary physician. Animal experiments provide reliable guidance 
for the determination of effective doses for human therapy. Interspecies scaling of effective doses can be 
performed following the principles laid down by Mordenti, J. and Chappell, W. "The use of interspecies scaling 
in toxicokinetics" In Toxicokinetics and New Drug Development, Yacobi et al., Eds., Pergamon Press, New 

20 York 1989, pp. 42-96. 

When in vivo administration of a PRO polypeptide or agonist or antagonist thereof is employed, normal 
dosage amounts may vary from about 10 ng/kg to up to 100 mg/kg of mammal body weight or more per day, 
preferably about 1 /xg/kg/day to 10 mg/kg/day, depending upon the route of administration. Guidance as to 
particular dosages and methods of delivery is provided in the literature; see, for example, U.S. Pat. Nos. 

25 4,657,760; 5,206,344; or 5,225,212. It is anticipated that different formulations will be effective for different 
treatment compounds and different disorders, that administration targeting one organ or tissue, for example, may 
necessitate delivery in a manner different from that to another organ or tissue. 

Where sustained-release administration of a PRO polypeptide is desired in a formulation with release 
characteristics suitable for the treatment of any disease or disorder requiring administration of the PRO 

30 polypeptide, microencapsulation of the PRO polypeptide is contemplated. Microencapsulation of recombinant 
proteins for sustained release has been successfully performed with human growth hormone (rhGH), interferon- 
(rhlFN- ), interleukin-2, and MN rgpi20. Johnson et al., Nat. Med. . 2:795-799 (1996); Yasuda, Biomed. 
Ther. . 27:1221-1223 (1993): Horaet al.. Bio/Technology. 8:755-758 (1990): Cleland, "Design and Production 
of Single Immunization Vaccines Using Polylactide Polyglycolide Microsphere Systems," in Vaccine Design: 

35 The Subunit and Adjuvant Approach. Powell and Newman, eds, (Plenum Press: New York, 1 995), pp. 439-462; 
WO 97/03692. WO 96/40072, WO 96/07399; and U.S. Pat. No. 5,654,010. 
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The sustained-release formulations of these proteins were developed using poly-lactic-coglycolic acid 
(PLGA) polymer due to its biocompatibility and wide range of biodegradable properties. The degradation 
products of PLGA, lactic and glycolic acids, can be cleared quickly within the human body. Moreover, the 
degradability of this polymer can be adjusted from months to years depending on its molecular weight and 
composition. Lewis, "Controlled release of bioactive agents from lactide/glycolide polymer, " in: M. Chasin and 
5 R. Langer (Eds.), Biodegradable Polymers as Drug Delivery Systems (Marcel Dekker: New York, 1990), pp. 
1-41. 

This invention encompasses methods of screening compounds to identify those that mimic the PRO 
polypeptide (agonists) or prevent the effect of the PRO polypeptide (antagonists). Screening assays for 
antagonist drug candidates are designed to identify compounds that bind or complex with the PRO polypeptides 

10 encoded by the genes identified herein, or otherwise interfere with the interaction of the encoded polypeptides 
with other cellular proteins. Such screening assays will include assays amenable to high-throughput screening 
of chemical libraries, making them particularly suitable for identifying small molecule drug candidates. 

The assays can be performed in a variety of formats, including protein-protein binding assays, 
biochemical screening assays, immunoassays, and cell-based assays, which are well characterized in the art. 

15 All assays for antagonists are common in that they call for contacting the drug candidate with a PRO 

polypeptide encoded by a nucleic acid identified herein under conditions and for a time sufficient to allow these 
two components to interact. 

In binding assays, the interaction is binding and the complex formed can be isolated or detected in the 
reaction mixture. In a particular embodiment, the PRO polypeptide encoded by the gene identified herein or the 

20 drug candidate is immobilized on a solid phase, e.g., on a microtiter plate, by covalent or non-covalent 
attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a solution of 
the PRO polypeptide and drying. Alternatively, an immobilized antibody, e.g. , a monoclonal antibody, specific 
for the PRO polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is performed 
by adding the non-immobilized component, which may be labeled by a detectable label, to the immobilized 

25 component, e.g., the coated surface containing the anchored component. When the reaction is complete, the 
non-reacted components are removed, e.g., by washing, and complexes anchored on the solid surface are 
detected. When the originally non-immobilized component carries a detectable label, the detection of label 
immobilized on the surface indicates that complexing occurred. Where the originally non-immobilized 
component does not carry a label, complexing can be detected, for example, by using a labeled antibody 

30 specifically binding the immobilized complex. 

If the candidate compound interacts with but does not bind to a particular PRO polypeptide encoded by 
a gene identified herein, its interaction with that polypeptide can be assayed by methods well known for detecting 
protein-protein interactions. Such assays include traditional approaches, such as, e.g., cross-linking, co- 
immunoprecipitation, and co-purification through gradients or chromatographic columns. In addition, protein- 

35 protein interactions can be monitored by using a yeast-based genetic system described by Fields and co-workers 
(Fields and Song, Nature (London), 340:245-246 (1989); Chien et ah, Proc. Natl. Acad. Sci. USA . 88:9578- 
9582 (1991)) as disclosed by Chevray and Nathans, Proc. Natl. Acad. Sci. USA . 89: 5789-5793 (1991). Many 
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transcriptional activators, such as yeast GAL4, consist of two physically discrete modular domains, one acting 
as the DNA-binding domain, the other one functioning as the transcription-activation domain. The yeast 
expression system described in the foregoing publications (generally referred to as the "two-hybrid system") 
takes advantage of this property, and employs two hybrid proteins, one in which the target protein is fused to 
the DNA-binding domain of GAM, and another, in which candidate activating proteins are fused to the 
5 activation domain. The expression of a GALl-tocZ reporter gene under control of a GAL4-activated promoter 
depends on reconstitution of GALA activity via protein-protein interaction. Colonies containing interacting 
polypeptides are detected with a chromogenic substrate for P-galactosidase. A complete kit 
(MATCHMAKER™) for identifying protein-protein interactions between two specific proteins using the two- 
hybrid technique is commercially available from Clontech. This system can also be extended to map protein 
10 domains involved in specific protein interactions as well as to pinpoint amino acid residues that are crucial for 
these interactions. 

Compounds that interfere with the interaction of a gene encoding a PRO polypeptide identified herein 
and other intra- or extracellular components can be tested as follows: usually a reaction mixture is prepared 
containing the product of the gene and the intra- or extracellular component under conditions and for a time 

15 allowing for the interaction and binding of the two products. To test the ability of a candidate compound to 
inhibit binding, the reaction is run in the absence and in the presence of the test compound. In addition, a 
placebo may be added to a third reaction mixture, to serve as positive control. The binding (complex formation) 
between the test compound and the intra- or extracellular component present in the mixture is monitored as 
described hereinabove. The formation of a complex in the control reaction(s) but not in the reaction mixture 

20 containing the test compound indicates that the test compound interferes with the interaction of the test compound 
and its reaction partner. 

To assay for antagonists, the PRO polypeptide may be added to a cell along with the compound to be 
screened for a particular activity and the ability of the compound to inhibit the activity of interest in the presence 
of the PRO polypeptide indicates that the compound is an antagonist to the PRO polypeptide. Alternatively, 

25 antagonists may be detected by combining the PRO polypeptide and a potential antagonist with membrane-bound 
PRO polypeptide receptors or recombinant receptors under appropriate conditions for a competitive inhibition 
assay. The PRO polypeptide can be labeled, such as by radioactivity, such that the number of PRO polypeptide 
molecules bound to the receptor can be used to determine the effectiveness of the potential antagonist. The gene 
encoding the receptor can be identified by numerous methods known to those of skill in the art, for example, 

30 ligand panning and FACS sorting. Coligan et al., Current Protocols in Immun.. 1(2): Chapter 5 (1991). 
Preferably, expression cloning is employed wherein polyadenylated RNA is prepared from a cell responsive to 
the PRO polypeptide and a cDNA library created from this RNA is divided into pools and used to transfect COS 
cells or other cells that are not responsive to the PRO polypeptide. Transfected cells that are grown on glass 
slides are exposed to labeled PRO polypeptide. The PRO polypeptide can be labeled by a variety of means 

35 including iodination or inclusion of a recognition site for a site-specific protein kinase. Following fixation and 
incubation, the slides are subjected to autoradiographic analysis. Positive pools are identified and sub-pools are 
prepared and re-transfected using an interactive sub-pooling and re-screening process, eventually yielding a 
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single clone that encodes the putative receptor. 

As an alternative approach for receptor identification, labeled PRO polypeptide can be photoaffinity- 
linked with cell membrane or extract preparations that express the receptor molecule. Cross-linked material is 
resolved by PAGE and exposed to X-ray film. The labeled complex containing the receptor can be excised, 
resolved into peptide fragments, and subjected to protein micro-sequencing. The amino acid sequence obtained 
5 from micro- sequencing would be used to design a set of degenerate oligonucleotide probes to screen a cDNA 
library to identify the gene encoding the putative receptor. 

In another assay for antagonists, mammalian cells or a membrane preparation expressing the receptor 
would be incubated with labeled PRO polypeptide in the presence of the candidate compound. The ability of 
the compound to enhance or block this interaction could then be measured. 

10 More specific examples of potential antagonists include an oligonucleotide that binds to the fusions of 

immunoglobulin with PRO polypeptide, and, in particular, antibodies including, without limitation, poly- and 
monoclonal antibodies and antibody fragments, single-chain antibodies, anti-idiotypic antibodies, and chimeric 
or humanized versions of such antibodies or fragments, as well as human antibodies and antibody fragments. 
Alternatively, a potential antagonist may be a closely related protein, for example, a mutated form of the PRO 

15 polypeptide that recognizes the receptor but imparts no effect, thereby competitively inhibiting the action of the 
PRO polypeptide. 

Another potential PRO polypeptide antagonist is an antisense RNA or DNA construct prepared using 
antisense technology, where, e.g., an antisense RNA or DNA molecule acts to block directly the translation of 
mRNA by hybridizing to targeted mRNA and preventing protein translation. Antisense technology can be used 

20 to control gene expression through triple-helix formation or antisense DNA or RNA, both of which methods are 
based on binding of a polynucleotide to DNA or RNA. For example, the 5' coding portion of the polynucleotide 
sequence, which encodes the mature PRO polypeptides herein, is used to design an antisense RNA 
oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide is designed to be 
complementary to a region of the gene involved in transcription (triple helix - see Lee et al., Nucl. Acids Res.. 

25 6:3073 (1979); Cooney et al., Science . 241: 456 (1988); Dervan et al., Science . 251:1360 (1991)), thereby 
preventing transcription and the production of the PRO polypeptide. The antisense RNA oligonucleotide 
hybridizes to the mRNA in vivo and blocks translation of the mRNA molecule into the PRO polypeptide 
(antisense - Okano, Neurochem. . 56:560 (1991); Oligodeoxvnucleotides as Antisense Inhibitors of Gene 
Expression (CRC Press: Boca Raton, FL, 1988). The oligonucleotides described above can also be delivered 

30 to cells such that the antisense RNA or DNA may be expressed in vivo to inhibit production of the PRO 
polypeptide. When antisense DNA is used, oligodeoxyribonucleotides derived from the translation-initiationsite, 
e.g., between about -10 and +10 positions of the target gene nucleotide sequence, are preferred. 

Potential antagonists include small molecules that bind to the active site, the receptor binding site, or 
growth factor or other relevant binding site of the PRO polypeptide, thereby blocking the normal biological 

35 activity of the PRO polypeptide. Examples of small molecules include, but are not limited to, small peptides 
or peptide-like molecules, preferably soluble peptides, and synthetic non-peptidyl organic or inorganic 
compounds. 
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Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. 
Ribozymes act by sequence-specifichybridization to the complementary target RN A, followed by endonucleolytic 
cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified by known techniques. 
For further details see, e.g., Rossi, Current Biology. 4:469-471 (1994), and PCT publication No. W0 97/33551 
(published September 18, 1997). 
5 Nucleic acid molecules in triple-helix formation used to inhibit transcription should be single-stranded 

and composed of deoxynucleotides. The base composition of these oligonucleotides is designed such that it 
promotes triple-helix formation via Hoogsteen base-pairing rules, which generally require sizeable stretches of 
purines or pyrimidines on one strand of a duplex. For further details see, e.g., PCT publication No. WO 
97/33551, supra. 

10 These small molecules can be identified by any one or more of the screening assays discussed 

hereinabove and/or by any other screening techniques well known for those skilled in the an. 

PR0189 can be used in assays with W01A6.1 of C. Elegans, phosphodiesterases, transporters and 
proteins which bind to fatty acids, to determine the relative activities of PR0189 against these proteins. The 
results can be applied accordingly. 

15 

F. Anti-PRO Antibodies 
The present invention further provides anti-PRO antibodies. Exemplary antibodies include polyclonal, 
monoclonal, humanized, bispecific, and heteroconjugate antibodies. 

20 1. Polyclonal Antibodies 

The anti-PRO antibodies may comprise polyclonal antibodies. Methods of preparing polyclonal 
antibodies are known to the skilled artisan. Polyclonal antibodies can be raised in a mammal, for example, by 
one or more injections of an immunizing agent and, if desired, an adjuvant. Typically, the immunizing agent 
and/or adjuvant will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. The 

25 immunizing agent may include the PRO polypeptide or a fusion protein thereof. It may be useful to conjugate 
the immunizing agent to a protein known to be immunogenic in the mammal being immunized. Examples of 
such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, bovine 
thyroglobulin, and soybean trypsin inhibitor. Examples of adjuvants which may be employed include Freund's 
complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 

30 The immunization protocol may be selected by one skilled in the an without undue experimentation. 

2. Monoclonal Antibodies 
The anti-PRO antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may be 
prepared using hybridoma methods, such as those described by Kohler and Milstein, Nature . 256:495 (1975). 
35 In a hybridoma method, a mouse, hamster, or other appropriate host animal, is typically immunized with an 
immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will specifically 
bind to the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. 
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The immunizing agent will typically include the PRO polypeptide or a fusion protein thereof. 
Generally, either peripheral blood lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen 
cells or lymph node cells are used if non-human mammalian sources are desired. The lymphocytes are then 
fused with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice. Academic Press, (1986) pp. 59-103]. 
5 Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine 
and human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells may be 
cultured in a suitable culture medium that preferably contains one or more substances that inhibit the growth or 
survival of the unfused, immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine 
guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will 
10 include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the growth of 
HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression of 
antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More 
preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from the Salk 

15 Institute Cell Distribution Center, San Diego, California and the American Type Culture Collection, Manassas, 
Virginia. Human myeloma and mouse-human heteromyeloma ceil lines also have been described for the 
production of human monoclonal antibodies [Kozbor, J. Immunol. . 133:3001 (1984); Brodeuretal., Monoclonal 
Antibody Production Techniques and Applications. Marcel Dekker, Inc., New York, (1987) pp. 51-63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed for the presence of 

20 monoclonal antibodies directed against PRO. Preferably, the binding specificity of monoclonal antibodies 
produced by the hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are 
known in the art. The binding affinity of the monoclonal antibody can, for example, be determined by the 
Scatchard analysis of Munson and Pollard, Anal. Biochem.. 107:220 (1980). 

25 After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution 

procedures and grown by standard methods [Goding, supra]. Suitable culture media for this purpose include, 
for example, Dulbecco's Modified Eagle's Medium and RPMI- 1640 medium. Alternatively, the hybridoma cells 
may be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture 

30 medium or ascites fluid by conventional immunoglobulin purification procedures such as, for example, protein 
A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as those described 
in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can be readily isolated 
and sequenced using conventional procedures (e.g. , by using oligonucleotide probes that are capable of binding 

35 specifically to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells of the 
invention serve as a preferred source of such DNA. Once isolated, the DNA may be placed into expression 
vectors, which are men transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, 
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or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal 
antibodies in the recombinant host cells. The DNA also may be modified, for example, by substituting the 
coding sequence for human heavy and light chain constant domains in place of the homologous murine sequences 
[U.S. Patent No. 4,816,567; Morrison et al., supral or by covalendy joining to the immunoglobulin coding 
sequence all or part of the coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 
5 polypeptide can be substituted for the constant domains of an antibody of the invention, or can be substituted for 
the variable domains of one antigen-combining site of an antibody of the invention to create a chimeric bivalent 
antibody. 

The antibodies may be monovalent antibodies. Methods for preparing monovalent antibodies are well 
known in the art. For example, one method involves recombinant expression of immunoglobulin light chain and 
10 modified heavy chain. The heavy chain is truncated generally at any point in the Fc region so as to prevent 
heavy chain crosslinking. Alternatively, the relevant cysteine residues are substituted with another amino acid 
residue or are deleted so as to prevent crosslinking. 

In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies to 
produce fragments thereof, particularly, Fab fragments, can be accomplished using routine techniques known 
15 in the art. 

3. Human and Humanized Antibodies 
The anti-PRO antibodies of the invention may further comprise humanized antibodies or human 
antibodies. Humanized forms of non-human (e.g., murine) antibodies are chimeric immunoglobulins, 

20 immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab') 2 or other antigen-binding 
subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. 
Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a 
complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non-human 
species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. In 

25 some instances , Fv framework residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Humanized antibodies may also comprise residues which are found neither in the recipient antibody 
nor in the imported CDR or framework sequences. In general, the humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or substantially all of the CDR 
regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR regions are 

30 those of a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin [Jones 
etal.. Nature. 321:522-525 (1986): Riechmannetal.. Nature, 332:323-329(1988); and Presta, Curr. Op. Struct. 
Biol. . 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the an. Generally, a humanized 

35 antibody has one or more amino acid residues introduced into it from a source which is non-human. These non- 
human amino acid residues are often referred to as "import" residues, which are typically taken from an "import " 
variable domain. Humanization can be essentially performed following the method of Winter and co-workers 
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[Jones et al., Nature . 321:522-525 (1986); Riechmann et al., Nature , 332:323-327 (1988); Verhoeyen et al., 
Science . 239:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the corresponding 
sequences of a human antibody. Accordingly, such "humanized'' antibodies are chimeric antibodies (U.S. Patent 
No. 4,816,567), wherein substantially less than an intact human variable domain has been substituted by the 
corresponding sequence from a non-human species. In practice, humanized antibodies are typically human 
5 antibodies in which some CDR residues and possibly some FR residues are substituted by residues from 
analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the art, including phage 
display libraries [Hoogenboom and Winter, J. Mol.Biol.. 227:381 (1991); Marks et al., J. Mol. Biol.. 222:581 
(1991)]. The techniques of Cole et al. and Boerner et al. are also available for the preparation of human 

10 monoclonal antibodies (Cole et al. , Monoclonal Antibodies and C ancer Therapy. Alan R. Liss, p. 77 (1985) and 
Boerner et al., J. Immunol.. 147(l) :86-95 (1991)]. Similarly, human antibodies can be made by introducing 
of human immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous immunoglobulin 
genes have been partially or completely inactivated. Upon challenge, human antibody production is observed, 
which closely resembles that seen in humans in all respects, including gene rearrangement, assembly, and 

15 antibody repertoire. This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 
5,569,825; 5,625,126; 5,633,425; 5,661,016, and in the following scientific publications: Marks et al, 
Bio/Technology 10. 779-783 (1992): Lonberg etal. Nature 368 856-859(1994): Morrison, Nature 368 .812-13 
(1994); Fishwild et aL, Nature Biotechnology 14. 845-51 (1996); Neuberger, Nature Biotechnology 14. 826 
(1996); Lonberg and Huszar, Intern. Rev. Immunol. 13 65-93 (1995). 

20 

4. Bispecific Antibodies 
Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens. In the present case, one of the binding specificities is for the 
PRO, the other one is for any other antigen, and preferably for a cell-surface protein or receptor or receptor 
25 subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the recombinant 
production of bispecific antibodies is based on the co-expression of two immunoglobulin heavy-chain/light-chain 
pairs, where the two heavy chains have different specificities [Milstein and Cuello, Nature . 305:537-539 (1983)] . 
Because of the random assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) 
30 produce a potential mixture of ten different antibody molecules, of which only one has the correct bispecific 
structure. The purification of the correct molecule is usually accomplished by affinity chromatography steps. 
Similar procedures are disclosed in WO 93/08829, published 13 May 1993, and in Traunecker et al., EMBO 
L, 10:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen combining sites) can 
35 be fused to immunoglobulin constant domain sequences. The fusion preferably is with an immunoglobulin 
heavy-chain constant domain, comprising at least pan of the hinge, CH2, and CH3 regions. It is preferred to 
have the first heavy-chain constant region (CH 1) containing the site necessary for light-chain binding present in 



368 



WO 99/63088 



PCT/US99/12252 



at least one of the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the 
immunoglobulin light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 
host organism. For further details of generating bispecific antibodies see, for example, Suresh et al. , Methods 
in Enzvmology. 121:210 (1986). 

According to another approach described in WO 96/27011, the interface between a pair of antibody 
5 molecules can be engineered to maximize the percentage of heterodimers which are recovered from recombinant 
cell culture. The preferred interface comprises at least a part of the CH3 region of an antibody constant domain. 
In this method, one or more small amino acid side chains from the interface of the first antibody molecule are 
replaced with larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or similar 
size to the large side chain(s) are created on the interface of the second antibody molecule by replacing large 

10 amino acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for increasing 
the yield of the heterodimer over other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. F(ab') 2 
bispecific antibodies). Techniques for generating bispecific antibodies from antibody fragments have been 
described in the literature. For example, bispecific antibodies can be prepared can be prepared using chemical 

15 linkage. Brennan et al. , Science 229:81 (1985) describe a procedure wherein intact antibodies are proteolytically 
cleaved to generate F(ab') 2 fragments. These fragments are reduced in the presence of the dithiol complexing 
agent sodium arsenite to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab* 
fragments generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 
derivatives is then reconverted to the Fab'-thiol by reduction with mercaptoethylamine and is mixed with an 

20 equimolar amount of the other Fab'-TNB derivative to form the bispecific antibody. The bispecific antibodies 
produced can be used as agents for the selective immobilization of enzymes. 

Fab' fragments may be directly recovered from E. coli and chemically coupled to form bispecific 
antibodies. Shalaby et al. t J. Exp. Med. 175:217-225 (1992) describe the production of a fully humanized 
bispecific antibody F(ab') 2 molecule. Each Fab' fragment was separately secreted from E. coli and subjected 

25 to directed chemical coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was 
able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic 
activity of human cytotoxic lymphocytes against human breast tumor targets. 

Various technique for making and isolating bispecific antibody fragments directly from recombinant cell 
culture have also been described. For example, bispecific antibodies have been produced using leucine zippers. 

30 Kostelny et al,, J. Immunol. 148(5): 1547-1553 (1992). The leucine zipper peptides from the Fos and Jun 
proteins were linked to the Fab' portions of two different antibodies by gene fusion. The antibody homodimers 
were reduced at the hinge region to form monomers and then re-oxidized to form the antibody heterodimers. 
This method can also be utilized for the production of antibody homodimers. The "diabody" technology 
described by Hollinger et aL, Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an alternative 

35 mechanism for making bispecific antibody fragments. The fragments comprise a heavy-chain variable domain 
(V H ) connected to a light-chain variable domain (V L ) by a linker which is too short to allow pairing between the 
two domains on the same chain. Accordingly, the V H and V L domains of one fragment are forced to pair with 
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the complementary V L and V H domains of another fragment, thereby forming two antigen-binding sites. Another 
strategy for making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been 
reported. See, Gruber et al. , J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific antibodies can be prepared. 
Tutt et ai, J. Immunol. 147:60 (1991). 
5 Exemplary bispecific antibodies may bind to two different epitopes on a given PRO polypeptide herein. 

Alternatively, an anti-PRO polypeptide arm may be combined with an arm which binds to a triggering molecule 
on a leukocyte such as a T-ceil receptor molecule (e.g. CD2, CD3, CD28, or B7), or Fc receptors for IgG 
(FcyR), such as FcyRI (CD64), FcyRH (CD32) and FcyRIII (CD 16) so as to focus cellular defense mechanisms 
to the cell expressing the particular PRO polypeptide. Bispecific antibodies may also be used to localize 
10 cytotoxic agents to cells which express a particular PRO polypeptide. These antibodies possess a PRO-binding 
arm and an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPTA, DOTA, 
or TETA. Another bispecific antibody of interest binds the PRO polypeptide and further binds tissue factor 
(TF). 

15 5. Heteroconjugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate 
antibodies are composed of two covalently joined antibodies. Such antibodies have, for example, been proposed 
to target immune system cells to unwanted cells [U.S. Patent No. 4,676,980] , and for treatment of HIV infection 
[WO 91/00360; WO 92/200373; EP 03089]. It is contemplated that the antibodies may be prepared in vitro 

20 using known methods in synthetic protein chemistry, including those involving crosslinking agents. For 
example, immunotoxins may be constructed using a disulfide exchange reaction or by forming a thioether bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4-mercaptobutyrimidate and 
those disclosed, for example, in U.S. Patent No. 4,676,980. 

25 6. Effector Function Engineering 

It may be desirable to modify the antibody of the invention with respect to effector function, so as to 
enhance, e.g., the effectiveness of the antibody in treating cancer. For example, cysteine residue(s) may be 
introduced into the Fc region, thereby allowing interchain disulfide bond formation in this region. The 
homodimeric antibody thus generated may have improved internalization capability and/or increased 

30 complement-mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et aL, L 
Exp Med .. 176: 1191-1195 (1992) and Shopes, J. Immunol .. 148 : 2918-2922 (1992). Homodimeric antibodies 
with enhanced anti-tumor activity may also be prepared using heterobifunctional cross-linkers as described in 
Wolff etal. Cancer Research. 53: 2560-2565 (1993). Alternatively, an antibody can be engineered that has dual 
Fc regions and may thereby have enhanced complement lysis and ADCC capabilities. See Stevenson et ai, Anti- 

35 Cancer Drug Design. 3: 219-230 (1989). 
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7. 



Immunoconiugates 



The invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic agent 
such as a chemotherapeutic agent, toxin (e.g. , an enzymaticaliy active toxin of bacterial, fungal, plant, or animal 
origin, or fragments thereof), or a radioactive isotope (i.e., a radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been described above. 
5 Enzymaticaliy active toxins and fragments thereof that can be used include diphtheria A chain, nonbinding active 
fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, 
modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins 
(PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, 
gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of radionuclides are 

10 available for the production of radioconjugated antibodies. Examples include 2l2 Bi, m I, l3, In, ^Y, and ,86 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional protein-coupling 
agents such as N-succimmidyl-3-(2-pyridyldithiol) propionate (SPDP), iminothiolane (IT), bifunctional 
derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), 
aldehydes (such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis- 

15 diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6- 
diisocyanate), and bis-active fluorine compounds (such as 1 ,5-difluoro-2,4-dinitrobenzene). For example, a ricin 
immunotoxin can be prepared as described in Vitetta et al., Science , 238 : 1098 (1987). Carbon- 14-labeled 1- 
isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent 
for conjugation of radionucleotide to the antibody. See W094/1 1026. 

20 In another embodiment, the antibody may be conjugated to a "receptor 1 * (such streptavidin) for 

utilization in tumor pretargeting wherein the antibody-receptor conjugate is administered to the patient, followed 
by removal of unbound conjugate from the circulation using a clearing agent and then administration of a 
"ligand" (e.g., avidin) that is conjugated to a cytotoxic agent (e.g., a radionucleotide). 



The antibodies disclosed herein may also be formulated as immunoliposomes. Liposomes containing 
the antibody are prepared by methods known in the art, such as described in Epstein et ai, Proc. Natl. Acad. 
Sri. USA . 82: 3688 (1985); Hwang etal. t Proc. Natl Acad. Sci. USA . 77: 4030 (1980); and U.S. Pat. Nos. 
4,485,045 and 4,544,545. Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 
30 5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation method with a lipid 
composition comprising phosphatidylcholine, cholesterol, and PEG-derivatized phosphaudylethanolamine (PEG- 
PE). Liposomes are extruded through filters of defined pore size to yield liposomes with the desired diameter. 
Fab' fragments of the antibody of the present invention can be conjugated to the liposomes as described in Martin 
35 et al .. J. Biol. Chem. . 257: 286-288 (1982) via a disulfide-interchange reaction. A chemotherapeutic agent 
(such as Doxorubicin) is optionally contained within the liposome. SeeGabizon et al. . J. National Cancer Inst. . 
8i(19): 1484 (1989). 



25 



8. 



Immunoliposomes 
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9. Pharmaceutical Compositions of Antibodies 
Antibodies specifically binding a PRO polypeptide identifiedherein, as well as other molecules identified 
by the screening assays disclosed hereinbefore, can be administered for the treatment of various disorders in the 
form of pharmaceutical compositions. 

If the PRO polypeptide is intracellular and whole antibodies are used as inhibitors, internalizing 
5 antibodies are preferred. However, lipofections or liposomes can also be used to deliver the antibody, or an 
antibody fragment, into cells. Where antibody fragments are used, the smallest inhibitory fragment that 
specifically binds to the binding domain of the target protein is preferred. For example, based upon the variable- 
region sequences of an antibody, peptide molecules can be designed that retain the ability to bind the target 
protein sequence. Such peptides can be synthesized chemically and/or produced by recombinant DNA 

10 technology. See, e.g., Marasco et al., Proc. Natl. Acad. Sci. USA. 90: 7889-7893 (1993). The formulation 
herein may also contain more than one active compound as necessary for the particular indication being treated, 
preferably those with complementary activities that do not adversely affect each other. Alternatively, or in 
addition, the composition may comprise an agent that enhances its function, such as, for example, a cytotoxic 
agent, cytokine, chemotherapeutic agent, or growth-inhibitory agent. Such molecules are suitably present in 

15 combination in amounts that are effective for the purpose intended. 

The active ingredients may also be entrapped in microcapsules prepared, for example, by coacervation 
techniques or by interfacial polymerization, for example, hydroxy methylcellulose or gelatin-microcapsules and 
poly-(methylmethacylate) microcapsules, respectively, in colloidal drug delivery systems (for example, 
liposomes, albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in macroemulsions. 

20 Such techniques are disclosed in Remington's Pharmaceutical Sciences , supra. 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished by 
filtration through sterile filtration membranes. 

Sustained-release preparations may be prepared. Suitable examples of sustained-release preparations 
include semipermeable matrices of solid hydrophobic polymers containing the antibody, which matrices are in 

25 the form of shaped articles, e.g., films, or microcapsules. Examples of sustained-release matrices include 
polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacrylate), or poly(vinylalcohol)), polylactides 
(U.S. Pat. No. 3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable ethylene- 
vinyl acetate, degradable lactic acid-glycolic acid copolymers such as the LUPRON DEPOT ™ (injectable 
microspheres composed of lactic acid-glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3- 

30 hydroxybutyric acid. While polymers such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release 
of molecules for over 100 days, certain hydrogels release proteins for shorter time periods. When encapsulated 
antibodies remain in the body for a long time, they may denature or aggregate as a result of exposure to moisture 
at 37°C, resulting in a loss of biological activity and possible changes in immunogenicity. Rational strategies 
can be devised for stabilization depending on the mechanism involved. For example, if the aggregation 

35 mechanism is discovered to be intermolecularS-S bond formation through thio-disulfide interchange, stabilization 
may be achieved by modifying sulfhydryl residues, lyophilizing from acidic solutions, controlling moisture 
content, using appropriate additives, and developing specific polymer matrix compositions. 
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G. Uses for anti-PRO Antibodies 

The anti-PRO antibodies of the invention have various utilities. For example, anti-PRO antibodies may 
be used in diagnostic assays for PRO, e.g., detecting its expression in specific cells, tissues, or serum. Various 
diagnostic assay techniques known in the art may be used, such as competitive binding assays, direct or indirect 
sandwich assays and immunoprecipitation assays conducted in either heterogeneous or homogeneous phases 
5 [Zola, Monoclonal Antibodies: A Manual of Techniques. CRC Press, Inc. (1987) pp. 147-158]. The antibodies 
used in the diagnostic assays can be labeled with a detectable moiety. The detectable moiety should be capable 
of producing, either directly or indirectly, a detectable signal. For example, the detectable moiety may be a 
radioisotope, such as 3 H, 14 C, 32 P, 35 S, or 125 I, a fluorescent or chemiluminescent compound, such as fluorescein 
isothiocyanate, rhodamine, or luciferin, or an enzyme, such as alkaline phosphatase, beta-galactosidase or 

10 horseradish peroxidase. Any method known in the art for conjugating the antibody to the detectable moiety may 
be employed, including those methods described by Hunter et al., Nature . 144:945 (1962); David et al. 9 
Biochemistry. 13:1014 (1974); Painet al., J. Immunol. Meth.. 40:219 (1981); and Nygren, J. Histochem. and 
Cvtochem.. 30:407 (1982). 

Anti-PRO antibodies also are useful for the affinity purification of PRO from recombinant cell culture 

15 or natural sources. In this process, the antibodies against PRO are immobilized on a suitable support, such a 
Sephadex resin or filter paper, using methods well known in the art. The immobilized antibody then is contacted 
with a sample containing the PRO to be purified, and thereafter the support is washed with a suitable solvent that 
will remove substantially all the material in the sample except the PRO, which is bound to the immobilized 
antibody. Finally, the support is washed with another suitable solvent that will release the PRO from the 

20 antibody. 

The following examples are offered for illustrative purposes only, and are not intended to limit the scope 
of the present invention in any way. 

All patent and literature references cited in the present specification are hereby incorporated by reference 
in their entirety. 

25 

EXAMPLES 

Commercially available reagents referred to in the examples were used according to manufacturer's 
instructions unless otherwise indicated. The source of those cells identified in the following examples, and 
throughout the specification, by ATCC accession numbers is the American Type Culture Collection, Manassas, 
30 VA. 

EXAMPLE 1 : Extracellular Domain Homology Screening to Identify Novel Polypeptides and cDNA Encoding 
Therefor 

The extracellular domain (ECD) sequences (including the secretion signal sequence, if any) from about 
35 950 known secreted proteins from the Swiss-Prot public database were used to search EST databases. The EST 
databases included public databases (e.g., Dayhoff, GenBank), and proprietary databases (e.g. LIFESEQ™, 
Incyte Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program WU-BLAST-2 
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(Altschul et al., Methods in Enzvmolopv 266:460-480 (1996)) as a comparison of the ECD protein sequences 
to a 6 frame translation of the EST sequences. Those comparisons with a Blast score of 70 (or in some cases 
90) or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences 
with the program "phrap" (Phil Green, University of Washington, Seattle, WA). 

Using this extracellular domain homology screen, consensus DNA sequences were assembled relative 
5 to the other identified EST sequences using phrap. In addition, the consensus DNA sequences obtained were 
often (but not always) extended using repeated cycles of WU-BLAST-2 and phrap to extend the consensus 
sequence as far as possible using the sources of EST sequences discussed above. 

Based upon the consensus sequences obtained as described above, oligonucleotides were then 
synthesized and used to identify by PCR a cDNA library that contained the sequence of interest and for use as 

10 probes to isolate a clone of the full-length coding sequence for a PRO polypeptide. Forward and reverse PCR 
primers generally range from 20 to 30 nucleotides and are often designed to give a PCR product of about 100- 
1000 bp in length. The probe sequences are typically 40-55 bp in length. In some cases, additional 
oligonucleotides are synthesized when the consensus sequence is greater than about 1 - 1 .5kbp. In order to screen 
several libraries for a full-length clone, DNA from the libraries was screened by PCR amplification, as per 

15 Ausubel et al. , Current Protocols in Molecular Biology, with the PCR primer pair. A positive library was then 
used to isolate clones encoding the gene of interest using the probe oligonucleotide and one of the primer pairs. 

The cDNA libraries used to isolate the cDNA clones were constructed by standard methods using 
commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with 
oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized 

20 appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as 
pRKB or pRKD; pRKSB is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al. , Science . 
253:1278-1280 (1991)) in the unique Xhol and NotI sites. 

EXAMPLE 2 : Isolation of cDNA clones by Amylase Screening 
25 1. Preparation of oligo dT primed cDNA library 

mRN A was isolated from a human tissue of interest using reagents and protocols from Invitrogen, San 

Diego, CA (Fast Track 2). This RNA was used to generate an oligo dT primed cDNA library in the vector 

pRK5D using reagents and protocols from Life Technologies, Gaithersburg, MD (Super Script Plasmid System). 

In this procedure, the double stranded cDNA was sized to greater than 1000 bp and the Sall/NotI linkered cDNA 
30 was cloned into XhoI/NotI cleaved vector. pRK5D is a cloning vector that has an sp6 transcription initiation 

site followed by an Sfil restriction enzyme site preceding the XhoI/NotI cDNA cloning sites. 

2. Preparation of random primed cDNA library 

A secondary cDNA library was generated in order to preferentially represent the 5' ends of the primary 
35 cDNA clones. Sp6 RNA was generated from the primary library (described above), and this RNA was used to 
generate a random primed cDNA library in the vector pSST-AMY.O using reagents and protocols from Life 
Technologies (Super Script Plasmid System, referenced above). In this procedure the double stranded cDNA 
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was sized to 500-1000 bp, linkered with blunt to Notl adaptors, cleaved with Sfil, and cloned into Sfil/NotI 
cleaved vector. pSST-AMY.O is a cloning vector that has a yeast alcohol dehydrogenase promoter preceding 
the cDNA cloning sites and the mouse amylase sequence (the mature sequence without the secretion signal) 
followed by the yeast alcohol dehydrogenase terminator, after the cloning sites. Thus, cDNAs cloned into this 
vector that are fused in frame with amylase sequence will lead to the secretion of amylase from appropriately 
5 transfected yeast colonies. 

3. Transformation and Detection 

DNA from the library described in paragraph 2 above was chilled on ice to which was added 
electrocompetent DH10B bacteria (Life Technologies, 20 ml). The bacteria and vector mixture was then 

10 electroporated as recommended by the manufacturer. Subsequently, SOC media (Life Technologies, 1 ml) was 
added and the mixture was incubated at 37°C for 30 minutes. The transformants were then plated onto 20 
standard 150 mm LB plates containing ampicillin and incubated for 16 hours (37°C). Positive colonies were 
scraped off the plates and the DNA was isolated from the bacterial pellet using standard protocols, e.g. CsCl- 
gradient. The purified DNA was then carried on to the yeast protocols below. 

15 The yeast methods were divided into three categories: (1) Transformation of yeast with the 

plasmid/cDNA combined vector; (2) Detection and isolation of yeast clones secreting amylase; and (3) PCR 
amplification of the insert directly from the yeast colony and purification of the DNA for sequencing and further 
analysis. 

The yeast strain used was HD56-5A (ATCC-90785). This strain has the following genotype: MAT 
20 alpha, ura3-52, leu2-3, leu2-112, his3-ll, his3-15, MAL + , SUC + , GAL\ Preferably, yeast mutants can be 
employed mat have deficient post-translational pathways. Such mutants may have translocation deficient alleles 
in secll, secll, sec62, with truncated secll being most preferred. Alternatively, antagonists (including 
antisense nucleotides and/or ligands) which interfere with the normal operation of these genes, other proteins 
implicated in this post translation pathway (e.g., SEC61p, SEC72p, SEC62p, SEC63p, TDJlp or SSAlp-4p) 
25 or the complex formation of these proteins may also be preferably employed in combination with the amylase- 
expressing yeast. 

Transformation was performed based on the protocol outlined by Gietz et al . , Nucl. Acid. Res.. 20 : 1425 

(1992). Transformed cells were then inoculated from agar into YEPD complex media broth (100 ml) and grown 

overnight at 30°C. The YEPD broth was prepared as described in Kaiser et al. , Methods in Yeast Genetics . 
30 Cold Spring Harbor Press, Cold Spring Harbor, NY, p. 207 (1994). The overnight culture was then diluted to 

about 2 x 10 6 cells/ml (approx. 00600=0.1) into fresh YEPD broth (500 ml) and regrown to 1 x 10 7 cells/ml 

(approx. 00600=0.4-0.5). 

The cells were then harvested and prepared for transformation by transfer into GS3 rotor bottles in a 

Sorval GS3 rotor at 5,000 rpm for 5 minutes, the supernatant discarded, and then resuspended into sterile water, 
35 and centriruged again in 50 ml falcon tubes at 3,500 rpm in a Beckman GS-6KR centrifuge. The supernatant 

was discarded and the cells were subsequently washed with LiAc/TE (10 ml, 10 mM Tris-HCl, 1 mM EDTA 

pH 7.5, 100 mM Li 2 OOCCH 3 ), and resuspended into LiAc/TE (2.5 ml). 

375 



WO 99/63088 



PCT/US99/12252 



Transformation took place by mixing the prepared cells (100 /tl) with freshly denatured single stranded 
salmon testes DNA (Lof strand Labs, Gaithersburg, MD) and transforming DNA (1 pg, vol. < 10 /d) in 
microfuge tubes. The mixture was mixed briefly by vortexing, then 40% PEG/TE (600 j*l, 40% polyethylene 
glycol-4000, 10 mM Tris-HCl, 1 mM EDTA, 100 mM Li 2 OOCCH 3 , pH 7.5) was added. This mixture was 
gently mixed and incubated at 30°C while agitating for 30 minutes. The cells were then heat shocked at 42°C 
5 for 15 minutes, and the reaction vessel centrifuged in a microfuge at 12,000 rpm for 5-10 seconds, decanted and 
resuspended into TE (500 j*l, 10 mM Tris-HCl, 1 mM EDTA pH 7.5) followed by recentrirugation. The cells 
were then diluted into TE (1 ml) and aliquots (200 /d) were spread onto the selective media previously prepared 
in 150 mm growth plates (VWR). 

Alternatively, instead of multiple small reactions, the transformation was performed using a single, large 
10 scale reaction, wherein reagent amounts were scaled up accordingly. 

The selective media used was a synthetic complete dextrose agar lacking uracil (SCD-Ura) prepared as 
described in Kaiser et al., Methods in Yeast Genetics. Cold Spring Harbor Press, Cold Spring Harbor, NY, p. 
208-210 (1994). Transformants were grown at 30°C for 2-3 days. 

The detection of colonies secreting amylase was performed by including red starch in the selective 
15 growth media. Starch was coupled to the red dye (Reactive Red- 120, Sigma) as per the procedure described by 
Biely et al., Anal. Biochem.. 172:176-179 (1988). The coupled starch was incorporated into the SCD-Ura agar 
plates at a final concentration of 0.15% (w/v), and was buffered with potassium phosphate to a pH of 7.0 (50- 
100 mM final concentration). 

The positive colonies were picked and streaked across fresh selective media (onto 150 mm plates) in 
20 order to obtain well isolated and identifiable single colonies. Well isolated single colonies positive for amylase 
secretion were detected by direct incorporation of red starch into buffered SCD-Ura agar. Positive colonies were 
determined by their ability to break down starch resulting in a clear halo around the positive colony visualized 
directly. 

25 4. Isolation of DNA bv PCR Amplification 

When a positive colony was isolated, a portion of it was picked by a toothpick and diluted into sterile 
water (30 /il) in a 96 well plate. At this time, the positive colonies were either frozen and stored for subsequent 
analysis or immediately amplified. An aliquot of cells (5 pi) was used as a template for the PCR reaction in a 
25 fi\ volume containing: 0.5 fi\ Klentaq (Clontech, Palo Alto, CA); 4.0 /il 10 mM dNTP's (Perkin Elmer- 
30 Cetus); 2.5 /il Kentaq buffer (Clontech); 0.25 /il forward oligo 1 ; 0.25 /tl reverse oligo 2; 12.5 /xl distilled water. 
The sequence of the forward oligonucleotide 1 was: 

5 ' -TGTAAA ACGACGGCC AGT TAAAT AGACCTGC AATTATT A ATCT- 3 ' (SEQ ID NO:3) 
The sequence of reverse oligonucleotide 2 was: 

5 ' -C AGG AAAC AGCT ATG ACC ACCTGC AC ACCTGC A A ATCC ATT- 3 ' (SEQ ID NO:4) 
35 PCR was then performed as follows: 

a. Denature 92°C, 5 minutes 

b. 3 cycles of: Denature 92°C, 30 seconds 
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Anneal 
Extend 



59° C, 30 seconds 
72°C, 60 seconds 



c. 



3 cycles of: 



Denature 

Anneal 

Extend 



92°C, 30 seconds 
57°C, 30 seconds 
72°C, 60 seconds 



5 



25 cycles of: 



Denature 

Anneal 

Extend 



92°C, 30 seconds 
55 °C, 30 seconds 
72 °C, 60 seconds 



10 



e. 



Hold 



4°C 



The underlined regions of the oligonucleotides annealed to the ADH promoter region and the amylase 
region, respectively, and amplified a 307 bp region from vector pSST-AMY.O when no insert was present. 
15 Typically, the first 18 nucleotides of the 5* end of these oligonucleotides contained annealing sites for the 
sequencing primers. Thus, the total product of the PCR reaction from an empty vector was 343 bp. However, 
signal sequence-fused cDNA resulted in considerably longer nucleotide sequences. 

Following the PCR, an aliquot of the reaction (5 pi) was examined by agarose gel electrophoresis in 
a 1 % agarose gel using a Tris-Borate-EDTA (TBE) buffering system as described by Sambrook et al., s upra. 
20 Clones resulting in a single strong PCR product larger than 400 bp were further analyzed by DNA sequencing 
after purification with a 96 Qiaquick PCR clean-up column (Qiagen Inc., Chats worth, CA). 

EXAMPLE 3 : Isolation of cDNA Clones Using Signal Algorithm Analysis 



25 sequence finding algorithm developed by Genentech, Inc. (South San Francisco, CA) upon ESTs as well as 
clustered and assembled EST fragments from public (e.g., GenBank) and/or private (LIFESEQ®, Incyte 
Pharmaceuticals, Inc., Palo Alto, CA) databases. The signal sequence algorithm computes a secretion signal 
score based on the character of the DNA nucleotides surrounding the first and optionally the second methionine 
codon(s) (ATG) at the 5'-end of the sequence or sequence fragment under consideration. The nucleotides 

30 following the first ATG must code for at least 35 unambiguous amino acids without any stop codons. If the first 
ATG has the required amino acids, the second is not examined. If neither meets the requirement, the candidate 
sequence is not scored. In order to determine whether the EST sequence contains an authentic signal sequence, 
the DNA and corresponding amino acid sequences surrounding the ATG codon are scored using a set of seven 
sensors (evaluation parameters) known to be associated with secretion signals. Use of this algorithm resulted 

35 in the identification of numerous polypeptide-encoding nucleic acid sequences. 

EXAMPLE 4 : Isolation of cDNA clones Encoding Human PRQ281 

In order to obtain a cDNA clone encoding PR0281, methods described in Klein et al., Proc. Natl. 
Acad. Sci. USA 93:7108-71 13 (1996) were employed with the following modifications. Yeast transformation 
40 was performed with limiting amounts of transforming DNA in order to reduce the number of multiple 
transformed yeast cells. Instead of plasmid isolation from the yeast followed by transformation of £. coli as 



Various polypeptide-encoding nucleic acid sequences were identified by applying a proprietary signal 



377 



WO 99/63088 



PCT/US99/12252 



described in Klein et al., supra. PGR analysis was performed on single yeast colonies. PCR primers employed 
were bipartite in order to amplify the insert and a small portion of the invertase gene (allowing to determine that 
the insert was in frame with invertase) and to add on universal sequencing primer sites. 

An invertase library was transformed into yeast and positives were selected on sucrose plates. Positive 
clones were re-tested and PCR products were sequenced. The sequence of one clone, PR0281 , was determined 
5 to contain a signal peptide coding sequence. Oligonucleotide primers and probes were designed using the 
nucleotide sequence of PR0281. A full length plasmid library of cDNAs from human umbilical vein 
endothelium tissue was titered and approximately 100,000 cfu were plated in 192 pools of 500 cru/pool into 96- 
well round bottom plates. The plates were sealed and pools were grown overnight at 37°C with shaking 
(200rpm). PCR was performed on the individual cultures using primers. Agarose gel electrophoresis was 

10 performed and positive wells were identified by visualization of a band of the expected size. Individual positive 
clones were obtained by colony lift followed by hybridization with 32 P-labeled oligonucleotide. These clones 
were characterized by PCR, restriction digest, and southern blot analyses. 

A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 80-82, and a stop signal at nucleotide positions 1115-1117 

15 (Figure 1, SEQ ID NO:l). The predicted polypeptide precursor is 345 amino acids long, has a calculated 
molecular weight of approximately 37,205 daltons and an estimated pi of approximately 10. 15. Analysis of the 
full-length PR0281 sequence shown in Figure 2 (SEQ ID NO:2) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 14, multiple transmembrane domains from about 
amino acid position 83 to about amino acid position 105, from about amino acid position 126 to about amino acid 

20 position 146, from about amino acid position 158 to about amino acid position 177, from about amino acid 
position 197 to about amino acid position 216, from about amino acid position 218 to about amino acid position 
238, from about amino acid position 245 to about amino acid position 265, and from about amino acid position 
271 to about amino acid position 290 and an amino acid sequence block having homology to G-protein coupled 
receptor proteins from about amino acid 1 15 to about amino acid 155. Clone UNQ244 (DNA 16422- 1209) has 

25 been deposited with ATCC on June 2, 1998 and is assigned ATCC deposit no. 209929. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 2 (SEQ ID NO:2), evidenced significant 
homology between the PR0281 amino acid sequence and the following Dayhoff sequences: H64634, 
AF033095J, B64815, YBHLJECOLI, EMEQUTRJ, AF064763_3, S53708, A69253, AF035413J2 and 

30 S63281. 

EXAMPLE 5 : Isolation of cDNA clones Encoding Human PRQ276 

In order to obtain a cDNA clone encoding PR0276, methods described in Klein etal., PNAS . 93:7108- 
7113 (1996) were employed with the following modifications. Yeast transformation was performed with limiting 
35 amounts of transforming DNA in order to reduce the number of multiple transformed yeast cells. Instead of 
plasmid isolation from the yeast followed by transformation of E. coli as described in Klein et al., supra . PCR 
analysis was performed on single yeast colonies. PCR primers employed were bipartite in order to amplify the 
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insert and a small portion of the invertase gene (allowing to determine that the insert was in frame with invertase) 
and to add on universal sequencing primer sites. 

An invertase library was transformed into yeast and positives were selected on sucrose plates. Positive 
clones were re-tested and PCR products were sequenced. The sequence of one clone, PR0276, was determined 
to contain a signal peptide coding sequence. Oligonucleotide primers and probes were designed using the 
5 nucleotide sequence of PR0276. A full length plasmid library of cDN As from human fetal liver cells was titered 
and approximately 100,000 cfu were plated in 192 pools of 500 cfu/pool into 96-well round bottom plates. The 
plates were sealed and pools were grown overnight at 37 C with shaking (200rpm). PCR was performed on the 
individual cultures using primers. Agarose gel electrophoresis was performed and positive wells were identified 
by visualization of a band of the expected size. Individual positive clones were obtained by colony lift followed 
* 10 by hybridization with 32 P-labeled oligonucleotide. These clones were characterized by PCR, restriction digest, 
and southern blot analyses. 

A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 180-182 and a stop signal at nucleotide positions 933-935 
(Figure 3; SEQ ID NO:5). The predicted polypeptide precursor is 251 amino acids long has a calculated 
15 molecular weight of approximately 28,801 daltons and an estimated pi of approximately 9.58. The 
transmembrane domains are approximately at amino acids 98- 1 1 6 and 1 52- 1 72 of the sequence shown in Figure 
4 (SEQ ID NO:6). Clone DNA16435-1208 (UNQ243) has been deposited with the ATCC and is assigned 
ATCC deposit no. 209930 . 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
20 alignment analysis of the full-length sequence shown in Figure 4 (SEQ ID NO:6), revealed some sequence 
identity between the PR0276 amino acid sequence and the following Dayhoff sequences: CEG25D7 2, 
ATT805 2, S69696, GRHR RAT, NPCBAABCDJ, AB013149J, P_R85942 and AP000006_5. 

EXAMPLE 6 : Isolation of cDNA clones Encoding Human PRO 189 

25 A clone designated herein as DNA14187 was isolated as described in Example 2 above from a human 

retina tissue library. The DNA14187 sequence is shown in Figure 7 (SEQ ID NO:9). Based on the DNA14187 
sequence shown in Figure 7 (SEQ ID NO:9), oligonucleotides were synthesized: 1) to identify by PCR a cDNA 
library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 
sequence for PR0189. Forward and reverse PCR primers generally range from 20 to 30 nucleotides and are 

30 often designed to give a PCR product of about 100-1000 bp in length. The probe sequences are typically 40-55 
bp in length. In order to screen several libraries for a full-length clone, DNA from the libraries was screened 
by PCR amplification, as per Ausubel et al., Current Protocols in Molecular Biology, with the PCR primer pair. 
A positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide 
and one of the primer pairs. 

35 A pair of PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5 ' -TTG ACCTAT AC AGAG ATTC ATC-3 ' (SEQ ID NO: 10); and 
reverse PCR primer 5 , -CTAAGAACTTCCCTCAGGATTTT-3' (SEQ ID NO: 1 1). 

379 



WO 99/63088 



PCT/US99/12252 



Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA14187 sequence 
which had the following nucleotide sequence: 
hybridization probe 

5 , -ATGAAGATCAATTTCAAGAAGCATGCACTTCTCCTCTTGC-3' (SEQ ID NO: 12). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
5 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0189 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human retina tissue (LIB94). The cDNA 
libraries used to isolate the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from Invitrogen, San Diego, CA, The cDNA was primed with oligo dT containing a NotI 
10 site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes etal., Science . 253:1278-1280 
(1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
15 PR0189 and the derived protein sequence for PR0189. 

The entire nucleotide sequence of DNA21624-1391 is shown in Figure 5 (SEQ ID N0:7). Clone 
DNA2 1624-1391 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 200-202 and ending at the stop codon at nucleotide positions 1301-1303 (Figure 5). The predicted 
polypeptide precursor is 367 amino acids long (Figure 6). The full-length PR0189 protein shown in Figure 6 
20 has an estimated molecular weight of about 41,871 daltons and a pi of about 5.06. Clone DNA21624-1391 has 
been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains the 
correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO:8, the putative N-glycosylation sites are at about 
amino acids 224-227, 246-249 and 285-288. A domain for cytosolic fatty-acid binding proteins is at amino acids 
25 78-107 of SEQ ID NO:8. The corresponding nucleotides can be routinely determined given the sequences 
provided herein. 

Some sequence identity was found to W01A6.1 and F35D1 1.1 1, C. Elegans proteins, designated in a 
Dayhoff database as CEW01A610 and CELF35D1 1_1 1 , respectively. Some sequence identity was also found 
to an antigen to malaria and to restin, designated in a Dayhoff database as PR05766 and AF014012_1, 
30 respectively. Some sequence identity was also found to a microtubule binding protein and to myosin, designated 
in a Dayhoff database as AF041382 1 and S07537, respectively. There is also some sequence identity with 1- 
phosphatidyIinositol-4, 5-bisphosphate, designated as PIP1RAT. 

EXAMPLE 7: Isolation of cDNA clones Encoding Hum an PRO190 
35 A clone designated herein as DNA14232 was isolated as described in Example 2 above from a human 

fetal retina tissue library. The DNA14232 sequence is shown in Figure 10 (SEQ ID NO: 15). Based on the 
DNA14232 sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
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the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO190. Forward and reverse PCR primers generally range from 20 to 30 nucleotides and are often designed 
to give a PCR product of about 100-1000 bp in length. The probe sequences are typically 40-55 bp in length. 
In order to screen several libraries for a full-length clone, DNA from the libraries was screened by PCR 
amplification, as per Ausubel et al., Current Protocols in Molecular Biology, with the PCR primer pair. A 
5 positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide and 
one of the primer pairs. 

A pair of PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 ' -CTATACCTACTGTAGCTTCT-3 * (SEQ ID NO: 16); and 
reverse PCR primer 5 ' -TC AG AGA ATTCCTTC C AGG A-3 ' (SEQ ID NO: 17). 
10 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA 14232 sequence 
which had the following nucleotide sequence: 
hybridization probe 

S'-ACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTCAACA^' (SEQ ID NO: 18). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
15 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO190 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human retina tissue (LIB94). The cDNA 
libraries used to isolate the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from lnvitrogen, San Diego, CA. The cDN A was grimed with oligo dT containing a NotI 
20 site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
pRK5B is a precursor of pRKSD that does not contain the Sfil site; see, Holmes et al. , Science , 253:1278-1280 
(1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave sequences which include the full-length 
25 DNA sequence for PRO190 [herein designated as DNA23334-1392] (SEQ ID NO: 13) and the derived protein 
sequence for PRO190. 

The entire nucleotide sequence of DNA23334-1392 is shown in Figure 8 (SEQ ID NO: 13). Clone 
DNA23334-1392 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 193- 195 and which ends at the stop codon at nucleotide positions 1465-1467 (Figure 8). The predicted 
30 polypeptide precursor is 424 amino acids long (Figure 9). The full-length PRO 190 protein shown in Figure 9 
has an estimated molecular weight of about 48,500 daltons and a pi of about 8.65. Clone DNA23334-1392 has 
been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains the 
correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO: 14, the putative transmembrane domains are at about 
35 amino acids 16-36, 50-74, 147-168, 229-250, 271-293, 298-318 and 328-368 of SEQ ID NO: 14. N- 
glycosylation sites are at about amino acids 128-131, 204-207, 218-221 and 274-377 of SEQ ID NO: 14. The 
corresponding nucleotides can be routinely determined given the sequences provided herein. 
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PRO 190 has sequence identity with at least the following Dayhoff sequences designated as: 
CEZK896_2, JC5023, GMS1_SCHP0 and S44668. 

EXAMPLE 8 : Isolation of cDNA clones Encoding Human PRQ341 

A clone designated herein as DNA 12920 was isolated as described in Example 2 above from a human 
5 placenta tissue library. The DNA12920 sequence is shown in Figure 13 (SEQ ID NO:21). The DNA12920 
sequence was then compared to various EST databases including public EST databases (e.g., GenBank), and a 
proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST 
sequences. The comparison was performed using the computer program BLAST or BLAST2 [Altschul et al., 
Methods in Enzvmologv. 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in 

10 some cases, 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). This 
consensus sequence is herein designated DNA25314. Oligonucleotide primers based upon the DNA25314 
sequence were then synthesized and employed to screen a human placenta cDNA library which resulted in the 
identification of the DNA26288-1239 clone shown in Figure 1 1 . The cloning vector was pRK5B (pRK5B is a 

15 precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science . 253: 1278-1280 (1991)), and 
the cDNA size cut was less than 2800 bp. 

A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 380-382, and a stop signal at nucleotide positions 1754-1756 
(Figure 11, SEQ ID NO: 19). The predicted polypeptide precursor is 458 amino acids long, has a calculated 

20 molecular weight of approximately 50,264 daltons and an estimated pi of approximately 8. 17. Analysis of the 
full-length PR0341 sequence shown in Figure 12 (SEQ ID NO:20) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 17, transmembrane domains from about amino acid 
171 to about amino acid 190, from about amino acid 220 to about amino acid 239, from about amino acid 259 
to about amino acid 275, from about amino acid 286 to about amino acid 305, from about amino acid 316 to 

25 about amino acid 335, from about amino acid 353 to about amino acid 378 and from about amino acid 396 to 
about amino acid 417 and potential N-glycosylation sites from about amino acid 145 to about amino acid 147 
and from about amino acid 155 to about amino acid 158. Clone DNA26288-1239 has been deposited with 
ATCC on April 21, 1998 and is assigned ATCC deposit no. 209792. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

30 alignment analysis of the full-length sequence shown in Figure 12 (SEQ ID NO:20), evidenced homology 
between the PR0341 amino acid sequence and the following Dayhoff sequences: S75696, H69788, D69852, 
A69888, B64918, F64752, LPU89276J, G64962, S52977 and S44253. 

EXAMPLE 9 : Isolation of cDNA clones Encoding Human PRO 180 
35 A clone designated herein as DNA 12922 was isolated as described in Example 2 above from a human 

placenta tissue library. The DNA12922 sequence is shown in Figure 16 (SEQ ID NO:24). The DNA12922 
sequence was then compared to various EST databases including public EST databases (e.g., GenBank), and a 
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proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST 
sequences. The comparison was performed using the computer program BLAST or BLAST2 [Altschul et ah, 
Methods in Enzvmology. 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in 
some cases, 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). 
5 An oligonucleotide probe was formed based upon the consensus sequence obtained above. This probe 

had the following sequence. 

5 • -ACCTGTT AG AA ATGTGGTGGTTTCAGC AAGGCCTC AGTTT (SEQ ID NO:25). 

This probe was used to screen a human placenta library prepared as described in paragraph 1 of Example 2 

above. The cloning vector was pRK5B (pRK5B is a precursor of pRK5D that does not contain the SMI site; see, 
10 Holmes et al., Science . 253:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp. A clone 

designated herein as DNA26843-1389 was obtained. 

The entire nucleotide sequence of DNA26843-1389 is shown in Figure 14 (SEQ ID NO:22). Clone 

DNA26843-1389 contains a single open reading frame with an apparent translational initiation site at nucleotide 

positions 121-123 and ending at the stop codon at nucleotide positions 919-921 (Figure 14). The predicted 
15 polypeptide precursor is 266 amino acids long (Figure 15). The full-length PRO 180 protein shown in Figure 

15 has an estimated molecular weight of about 29,766 daltons and a pi of about 8.39. Clone DNA26843-1389 

has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 

the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Still analyzing the amino acid sequence of SEQ ID NO:23, the transmembrane domains are at about 
20 amino acids 13-33 (type II), 54-73, 94-113, 160-180 and 122-141 of SEQ ID NO:23. N-myristoylation sites 

are at about amino acids 57-62, 95-100, 99-104, 124-129 and 183-188 of SEQ ID NO:23. The corresponding 

nucleotides can be routinely determined given the sequences provided herein. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 15 (SEQ ID NO:23), evidenced some sequence 
25 identity between the PRO180 amino acid sequence and the following Dayhoff sequences: CEC33A112, 

CEG11E6J, CELW03A5_1 AND PEU83861_2 (NADH dehydrogenase subunit 4L, mitochondrion). 

EXAMPLE 10 : Isolation of cDNA clones Encoding Human PRO 194 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

30 in Example 1 above. This consensus sequence is herein DNA19464. Based on the DNA19464 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 
of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0194. PCR 
primers (forward and reverse) were synthesized based upon the DNA19464 sequence. Additionally, a synthetic 
oligonucleotide hybridization probe was constructed from the consensus DNA19464 sequence. 

35 in order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 194 gene using the probe oligonucleotide and one of the PCR primers. RNA 
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for construction of the cDNA libraries was isolated from human fetal lung tissue (LIB25). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0194 [herein designated as DNA26844-1394] (SEQ ID NO:27) and the derived protein sequence for 
PR0194. 

The entire nucleotide sequence of DNA26844-1394 is shown in Figure 17 (SEQ ID NO:27). Clone 
5 DNA26844-1394 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 81-83 and ending at the stop codon at nucleotide positions 873-875 (Figure 17). The predicted 
polypeptide precursor is 264 amino acids long (Figure 18). The full-length PR0194 protein shown in Figure 
18 has an estimated molecular weight of about 29,665 daltons and a pi of about 9.34. Analysis of the full-length 
PR0194 sequence shown in Figure 18 (SEQ ID NO:28) evidences the presence of various important 
10 polypeptides domains as shown in Figure 18. Clone DNA26844-1394 has been deposited with ATCC on June 
2, 1998 and is assigned ATCC deposit no. 209926. 

Analysis of the amino acid sequence of the full-length PRO 194 polypeptide suggests that it does not 
exhibit significant sequence similarity to any known human protein. However, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some homology between the PR0194 amino acid sequence and 
15 the following Dayhoff sequences, HUMORFT1, CET07F10_5 t ATFCA9_12, F64934, YDJXJECOLI, 
ATAF00065719F29G20.19, H70002, S76980, H64934 and S76385. 

EXAMPLE 11: Isolation of cDNA clones Encoding Human PRQ2Q3 

A clone designated herein as DNA15618 was isolated as described in Example 2 above from a human 

20 fetal lung tissue library. The DNA15618 sequence is shown in Figure 21 (SEQ ID NO:31). Oligonucleotide 
probes were generated from the sequence of the DNA 156 18 molecule and were used to screen a human fetal lung 
library (LIB26) prepared as described in paragraph 1 of Example 2 above. The cloning vector was pRK5B 
(pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al. , Science . 253 : 1278-1280 
(1991)), and the cDNA size cut was less than 2800 bp. 

25 A full length clone was identified that contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 159-161 and ending at the stop codon found at nucleotide 
positions 1200-1202 (Figure 19; SEQ ID NO:29). The predicted polypeptide precursor is 347 amino acids long, 
has a calculated molecular weight of approximately 39,870 daltons and an estimated pi of approximately 6.76. 
Analysis of the full-length PRO203 sequence shown in Figure 20 (SEQ ID NO: 30) evidences the presence of 

30 the following: a type II transmembrane domain at about amino acid 64 to about amino acid 87; possible N- 
glycosylation sites at about amino acid 147 to about amino acid 150, about amino acid 155 to about amino acid 
158, and about amino acid 237 to about amino acid 240; sequence identity with heavy-metal-associated domain 
proteins at about amino acid 23 to about amino acid 45, and sequence identity with D-isomer specific 2- 
hydroxyacid dehydrogenase at about amino acid 24 to about amino acid 34. Clone DNA30862-1396 was 

35 deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit no. 209920. 

Analysis of the amino acid sequence of the full-length PRO203 polypeptide suggests that it possesses 
sequence similarity to GST ATPase, thereby indicating that PRO203 may be a novel GST ATPase. More 
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specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between the 
PRO203 amino acid sequence and the following Dayhoff sequences, AF008124_1, CFRCD1GENJ, and 
P_R82566. 

EXAMPLE 12 : Isolation of cDNA clones Encoding Human PRO290 
5 An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) 

was searched and an EST was identified that had homology to beige and FAN. An oligonucleotide probe based 
upon the identified EST sequence was then synthesized and used to screen human fetal kidney cDNA libraries 
in an attempt to identify a full-length cDNA clone. The oligonucleotide probe had the following sequence: 
5' TGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTG 3* (SEQ ID NO:34). 

10 RNA for construction of cDNA libraries was isolated from human fetal kidney tissue. The cDNA 

libraries used to isolate the cDNA clones encoding human PRO290 were constructed by standard methods using 
commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with 
oligo dT containing a NotI site, linked with blunt to Sail hemikinascd adaptors, cleaved with NotI, sized 
appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as 

15 pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al. , Science 
253:1278-1280 (1991)) in the unique Xhol and NotI. 

A cDNA clone was identified and sequenced in entirety. The entire nucleotide sequence of DNA35680- 
1212 is shown in Figure 22 (SEQ ID NO:32). Clone DNA35680- 12 12 contains a single open reading frame with 
an apparent translational initiation site at nucleotide positions 293-295, and a stop codon at nucleotide positions 

20 3302-3304 (Figure 22; SEQ ID NO:32). The predicted polypeptide precursor is 1003 amino acids long. 

It is currently believed that the PRO290 polypeptide is related to FAN and/or beige. Clone DNA35680- 
1212 has been deposited with ATCC and is assigned ATCC deposit no. 209790. It is understood that the 
deposited clone has the actual correct sequence rather than the representations provided herein. The full-length 
PRO290 protein shown in Figure 23 has an estimated molecular weight of about 1 12,013 daltons and a pi of 

25 about 6.4. 

EXAMPLE 13 : Isolation of cDNA Clones Encoding Human PRQ874 

A consensus DNA sequence designated herein as DNA36459 was identified using phrap as described 

in Example 1 above. Based on the DNA36459 consensus sequence, oligonucleotides were synthesized: 1) to 
30 identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a 

clone of the coding sequence for PR0874. 

PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5 '-TCGTGCCCAGGGGCTGATGTGC-3' (SEQ ID NO:37); and 

reverse PCR primer S^GTCTTTACCCAGCCCCGGGATGCGO' (SEQ ID NO:38). 
35 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA36459 

sequence which had the following nucleotide sequence: 
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hybridization probe 

S'-GGCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGG-S' (SEQ ID NO:39). 

In order to screen several libraries for a source of a clone, DNA from the libraries was screened by 
PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0874 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction 
5 of the cDNA libraries was isolated from human fetal lung tissue (UB25). 

DNA sequencing of the clones isolated as described above gave the DNA sequence for PR0874 [herein 
designated as DNA4062 1-1440] (SEQ ID NO:35) and the derived protein sequence for PR0874. 

The entire nucleotide sequence of DNA40621-1440 is shown in Figure 24 (SEQ ID NO:35). Clone 
DNA40621-1440 contains a single open reading frame ending at the stop codon at nucleotide positions 964-966 
10 (Figure 24). The predicted polypeptide encoded by DNA40621-1440 is 321 amino acids long (Figure 25). The 
PR0874 protein shown in Figure 25 has an estimated molecular weight of about 36, 194 daltons and a pi of about 
9.85. Analysis of the PR0874 sequence shown in Figure 25 (SEQ ID NO:36) evidenced the presence of the 
following: a type II transmembrane domain at about amino acids 57-80; additional transmembrane domains at 
about amino acids 1 10-126, 215-231, and 254-274; potential N-glycosylation sites at about amino acids 16-19, 
15 27-30, and 289-292; sequence identity with hypothetical YBR002c family proteins at about amino acids 276-287; 
and sequence identity with ammonium transporter proteins at about amino acids 204-230. Clone DNA40621- 
1440 was deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit no. 209922. 

Analysis of the amino acid sequence of the PR0874 polypeptide suggests that it is a novel multi-span 
transmembrane protein. However, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
20 sequence identity between the PR0874 amino acid sequence and the following Dayhoff sequences: S67049, 
AF054839J, S73437, S52460, and HIVU80570J. 

EXAMPLE 14 : Isolation of cDNA Clones Encoding Human PRO710 

A yeast screening assay was employed to identify cDNA clones that encoded potential secreted proteins. 
25 Use of this yeast screening assay allowed identification of a single cDNA clone whose sequence (herein 

designated as DNA38190) is shown in Figure 28 (SEQ ID NO:42). Based on the DNA38190 sequence shown 

in Figure 28, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the 

sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO710. 

In order to screen several libraries for a full-length clone, DNA from the libraries was screened by PCR 
30 amplification, as per Ausubel et al., Current Protocols in Molecular Biolopv. with the PCR primer pair. A 

positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide and 

one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5 '-TTCCGCAAAGAGTTCTACG AGGTGG-3 ' (SEQ ID NO:43) 
35 reverse PCR primer S'-ATTGACAACATTGACTGGCCTATGGG-S* (SEQ ID NO:44) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA38190 sequence 

which had the following nucleotide sequence 
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hybridization probe 

5 ' -GTGG ATGCTCTGTGTGCGTGC AAG ATCCTTC AGGCCTTGTTCC AGTGTGA-3 ' (SEQ ID NO:45) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PGR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO710 gene using the probe oligonucleotide and one of the PCR primers. 
5 RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 

The cDNA libraries used to isolate the cDNA clones were constructed by standard methods using commercially 
available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT 
containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately 
by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 

10 pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science . 
253:1278-1280 (1991)) in the unique Xhol and NotI sites. 

A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 67-69 and ending at the stop codon found at nucleotide positions 
1765-1767 (Figure 26, SEQ ID NO:40). The predicted polypeptide precursor is 566 amino acids long, has a 

15 calculated molecular weight of approximately 65,555 daltons and an estimated pi of approximately 5.44. 
Analysis of the full-length PRO710 sequence shown in Figure 27 (SEQ ID NO:41) evidences the presence of 
the following: a signal peptide from about amino acid 1 to about amino acid 32, a transmembrane domain from 
about amino acid 454 to about amino acid 476, an aminoacyl-transfer RNA synthetase class-II signature sequence 
from about amino acid 6 to about amino acid 26 and potential N-glycosylation sites from about amino acid 1 1 1 

20 to about amino acid 114, from about amino acid 146 to about amino acid 149 and from about amino acid 292 
to about amino acid 295. Clone DNA44161-1434 has been deposited with ATCC on May 27, 1998 and is 
assigned ATCC deposit no. 209907. 

Analysis of the amino acid sequence of the full-length PRO710 polypeptide suggests that it possesses 
significant sequence similarity to the CDC45 protein, thereby indicating that PRO710 may be a novel CDC45 

25 homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PRO710 amino acid sequence and the following Dayhoff sequences, 
HSAJ3728J, CEF34D10J, S64939, UMU50276J, TRHYSHEEP, CELT14E8J, RNA1_YEAST, 
LVU89340J, HSU80736J and CEZK337_2. 

30 EXAMPLE 15: Isolation of cDNA clones Encoding Human PROl 151 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA40665. Based on the DNA40665 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 

35 PROl 151. 
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PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 * -CC AG ACGCTGCTCTTCGAAAGGGTC-3 * (SEQ ID NO:48) 
reverse PCR primer 5 '-GGTCCCCGTAGGCCAGGTCC AGC-3 ' (SEQ ID NO:49) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40665 
sequence which had the following nucleotide sequence 
5 hybridization probe 

5 ' -CTACTTCTTC AGCCTC AATGTGC AC AGCTGG AATT AC A AGG AG ACGT ACG-3 ' (SEQ ID NO:50) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 15 1 gene using the probe oligonucleotide and one of the PCR primers. RNA 
10 for construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROl 151 (designated herein as DNA44694-1500 [Figure 29, SEQ ID NO:46]; and the derived protein sequence 
for PROl 151. 

The entire nucleotide sequence of DNA44694-1500 is shown in Figure 29 (SEQ ID NO:46). Clone 
15 DNA44694-1500 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 272-274 and ending at the stop codon at nucleotide positions 1049-1051 (Figure 29). The predicted 
polypeptide precursor is 259 amino acids long (Figure 30). The full-length PRO 1151 protein shown in Figure 
30 has an estimated molecular weight of about 28,770 daltons and a pi of about 6. 12. Analysis of the full-length 
PROl 151 sequence shown in Figure 30 (SEQ ID NO:47) evidences the presence of the following: a signal 
20 peptide from about amino acid 1 to about amino acid 20, a potential N-glycosylation site from about amino acid 
72 to about amino acid 75 and amino acid sequence blocks having homology to Clq domain-containing proteins 
from about amino acid 144 to about amino acid 178, from about amino acid 78 to about amino acid 1 1 1 and from 
about amino acid 84 to about amino acid 117. Clone UNQ581 (DNA44694-1500) has been deposited with 
ATCC on August 11, 1998 and is assigned ATCC deposit no. 203114. 
25 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

alignment analysis of the full-length sequence shown in Figure 30 (SEQ ID NO:47), evidenced significant 
homology between the PROl 15 1 amino acid sequence and the following Dayhoff sequences: ACR3_HUMAN, 
HP25JTAMAS, HUMC1QB2J, PJ199306, CA1F_HUMAN, JX0369, CA24_HUMAN, S32436, P_R28916 
and CA54_HUMAN. 

30 

EXAMPLE 16 : Isolation of cDNA clones Encoding Human PRO 1282 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein as DNA33778. Based on theDNA33778 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
35 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01282. 
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PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5TCTTCAGCCGCTTGCGCAACCTC3' (SEQ ID NO:53); and 
reverse PCR primer 5 > TTGCTCACATCCAGCTCCTGCAGG3* (SEQ ID NO:54). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA33778 sequence which had the following nucleotide sequence: 
5 hybridization probe 

5TGGATGTTGTCCAGACAACCAGCTGGAGCTGTATCCGAGGC3' (SEQ ID NO:55). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR01282 gene using the probe oligonucleotide and one of the PCR primers. RNA 
10 for construction of the cDNA libraries was isolated from human fetal liver. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01282 (designated herein as DNA45495-1550 [Figure 3 1 , SEQ ID NO:51] ; and the derived protein sequence 
for PR01282. 

The entire coding sequence of PR01282 is shown in Figure 3 1 (SEQ ID NO:5 1). Clone DNA45495- 
15 1550 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
120-122, and an apparent stop codon at nucleotide positions 2139-2141 (SEQ ID NO:51). The predicted 
polypeptide precursor is 673 amino acids long. The signal peptide is at about amino acids 1-23; the 
transmembrane domain is at about amino acids 579-599; an EGF-like domain cysteine pattern signature starts 
at about amino acid 430; and leucine zipper patterns start at about amino acids 197 and 269 of SEQ ID NO:52, 
20 see Figure 32. Clone DNA45495-1550 has been deposited with the ATCC and is assigned ATCC deposit no. 
203156. The full-length PR01282 protein shown in Figure 32 has an estimated molecular weight of about 
71,655 daltons and a pi of about 7.8. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 32 (SEQ ID NO:52), revealed sequence identity 
25 between the PRO 1282 amino acid sequence and the following Dayhoff sequences (data from database 
incorporated by reference): AB007876_1, RNPLGPVJ, MUSLRRP1, ALS_PAPPA, AC004142_1, 
ALS_HUMAN, AB014462 1, DMTARTANJ, HSCHON03J and S46224. 

EXAMPLE 17 : Isolation of cDNA clones Encoding Human PRQ358 
30 Using the method described in Example 1 above, a single EST sequence was identified in the Incyte 

database, designated herein as INC3115949. Based on the INC31 15949 EST sequence, oligonucleotides were 

synthesized to identify by PCR a cDNA library that contained the sequence of interest and for use as probes to 

isolate a clone of the full-length coding sequence for PR0358. 

A pair of PCR primers (forward and reverse) were synthesized: 
35 forward PCR primer 5'-TCCCACCAGGTATCATAAArTGA A.V (SEQ ID NO:58) 

reverse PCR primer 5 ' -TTATAGAC A ATCTGTTCTC ATC AG A fi A - V (SEQ ID NO:59) 
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A probe was also synthesized: 

5 ' - AA A A AGC ATACTTGGA ATGGCCC AAGG ATAGGTGT AAATG-3 ' (SEQ ID NO:60) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0358 gene using the probe oligonucleotide and one of the PCR primers. RNA 
5 for construction of the cDNA libraries was isolated from human bone marrow (LIB256). The cDNA libraries 
used to isolated the cDNA clones were constructed by standard methods using commercially available reagents 
such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT staining a NotI site, 
linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel electrophoresis, 
and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; pRK5B is a precursor 
10 of pRK5D that does not contain the Sfil site; see, Holmes et al., Science . 253:1278-1280 (1991)) in the unique 
Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0358 (Figure 33, SEQ ID NO:56) and the derived protein sequence for PR0358 (Figures 34, SEQ ID 
NO:57). 

15 The entire nucleotide sequence of the clone identified (DNA4736M 154) is shown in Figure 33 (SEQ 

ID NO:56). Clone DNA47361-1154 contains a single open reading frame with an apparent translational 
initiation site (ATG start signal) at nucleotide positions underlined in Figure 33. The predicted polypeptide 
precursor is 811 amino acids long, including a putative signal sequence (amino acids 1 to 19), an extracellular 
domain (amino acids 20 to 575, including leucine rich repeats in the region from position 55 to position 575), 

20 a putative transmembrane domain (amino acids 576 to 595). Clone DNA47361-1249 has been deposited with 
ATCC and is assigned ATCC deposit no. 209431. 

EXAMPLE 18 : Isolation of cDNA clones Encoding Human PRO1310 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
25 in Example 1 above. This consensus sequence is designated herein as DNA37164. Based on the DNA37164 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO1310. 

PCR primers (forward and reverse) were synthesized: 
30 forward PCR primer: S'GTTCTCAATGAGCTACCCGTCCra' (SEQ ID NO:63) and 
reverse PCR primer^'CCCGATrTTAnTnnA ACTCGGGCTC3' (SEQ ID NO:64). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA47394 sequence which had the following nucleotide sequence: 
hybridization probe: 

35 5 ' ATCCGCATAAACCCTCAGTCCTGGTTTGATAATGGGAGCATCTGC ATGAG3 ' (SEQ ID NO:65). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
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isolate clones encoding the PRO1310 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal liver tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PRO1310 and the derived protein sequence for PRO1310. 

The entire coding sequence of PRO1310 is shown in Figures 35A-B (SEQ ID NO:61). Clone 
5 DNA47394-1572 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 326-328, and an apparent stop codon at nucleotide positions 2594-2596 (SEQ ID NO:61). The 
predicted polypeptide precursor is 765 amino acids long. The signal peptide is at about amino acids 1-25 of SEQ 
ID NO:62. Clone DNA47394-1572 has been deposited with ATCC and is assigned ATCC deposit no. 203109. 
The full-length PRO1310 protein shown in Figure 36 has an estimated molecular weight of about 85,898 daltons 
10 and a pi of about 6.87. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 36 (SEQ ID NO:62), revealed sequence identity 
between the PRO1310 amino acid sequence and the following Dayhoff sequences: AF017639 1, P_W36817, 
JC5256, CBPHHUMAN, MMU23184J, CBPN_HUMAN, HSU834I1J, CEF01D4_7, RNU62897 1 and 
15 P_W11851. 

EXAMPLE 19 : Isolation of cDNA Clones Encoding Human PRQ698 

A yeast screening assay was employed to identify cDN A clones that encoded potential secreted proteins. 

Use of this yeast screening assay allowed identification of a single cDNA clone whose sequence (herein 
20 designated as DNA39906) is shown in Figure 39 (SEQ ID NO:68). Based on the DNA39906 sequence shown 

in Figure 39, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the 

sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0698. 

In order to screen several libraries for a full-length clone, DNA from the libraries was screened by PCR 

amplification, as per Ausubel et ah, Current Protocols in Molecular Biology, with the PCR primer pair. A 
25 positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide and 

one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 

forward PCR primer S'-AGCTGTGGTCATGGTGGTGTGGTGO' (SEQ ID NO:69) 

reverse PCR primer 5 * -CTACCTTGGCC AT AGGTG ATCCGC-3 ' (SEQ ID NO:70) 
30 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA39906 

sequence which had the following nucleotide sequence 

hybridization probe 

5 * -C ATCAGC AAACCGTCTGTGGTTC AGCTCA ACTGG AG AGGGTT-3 ' (SEQ ID NO:71) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0698 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human bone marrow tissue (LIB255). The cDNA 
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libraries used to isolate the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT containing a NotI 
site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science . 253:1278-1280 
5 (1991)) in the unique Xhol and NotI sites. 

A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 14-16 and ending at the stopcodon found at nucleotide positions 
1544-1546 (Figure 37, SEQ ID NO:66). The predicted polypeptide precursor is 510 amino acids long, has a 
calculated molecular weight of approximately 57,280 daltons and an estimated pi of approximately 5.61. 

10 Analysis of the full-length PR0698 sequence shown in Figure 38 (SEQ ID NO:67) evidences the presence of 
the following: a signal peptide from about amino acid 1 to about amino acid 20, potential N-glycosylation sites 
from about amino acid 72 to about amino acid 75, from about amino acid 136 to about amino acid 139, from 
about amino acid 193 to about amino acid 196, from about amino acid 253 to about amino acid 256, from about 
amino acid 352 to about amino acid 355 and from about amino acid 41 1 to about amino acid 414 an amino acid 

15 block having homology to legume lectin beta-chain proteins from about amino acid 20 to about amino acid 39 
and an amino acid block having homology to the HBGF/FGF family of proteins from about amino acid 338 to 
about amino acid 365. Clone DNA48320-1433 has been deposited with ATCC on May 27, 1998 and is assigned 
ATCC deposit no. 209904. 

Analysis of the amino acid sequence of the full-length PR0698 polypeptide suggests that it possesses 

20 significant sequence similarity to the olfactomedin protein, thereby indicating that PR0698 may be a novel 
olfactomedin homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PR0698 amino acid sequence and the following Dayhoff sequences, 
OLFMJIANCA, 173637, AB006686S3J, RNU78105J, RNU72487J, P_R98225, CELC48E7_4, 
CEF11C33, XLU85970_1 and S42257. 

25 

EXAMPLE 20 : Isolation of cDNA Clones Encoding Human PRQ732 

A yeast screening assay was employed to identify cDNA clones that encoded potential secreted proteins. 
Use of this yeast screening assay allowed identification of a single cDNA clone whose sequence (herein 
designated as DNA42580) is shown in Figure 45 (SEQ ID NO:77). The DNA42580 sequence was then 

30 compared to a variety of known EST sequences to identify homologies. The EST databases employed included 
public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte 
Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program BLAST or BLAST2 
(Altshul et al., Methods in Enzvmologv 266:460-480 (1996)) as a comparison to a 6 frame translation of the EST 
sequence. Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 

35 encode known proteins were clustered and assembled into consensus DNA sequences with the program "phrap" 
(Phil Green, University of Washington, Seattle, Washington). 
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Using the above analysis, a consensus DNA sequence was assembled relative to other EST sequences 
using phrap. This consensus sequence is herein designated consenOl . Proprietary Genentech EST sequences 
were employed in the consensus assembly and they are herein designated DNA20239 (Figure 42; SEQ ID 
NO:74), DNA38050 (Figure 43; SEQ ID NO:75) and DNA40683 (Figure 44; SEQ ID NO:76). 

Based on the consenOl sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA 
5 library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 
sequence for PR0732. Forward and reverse PCR primers generally range from 20 to 30 nucleotides and are 
often designed to give a PCR product of about 100-1000 bp in length. The probe sequences are typically 40-55 
bp in length. In some cases, additional oligonucleotides are synthesized when the consensus sequence is greater 
than about l-1.5kbp. In order to screen several libraries for a full-length clone, DNA from the libraries was 

10 screened by PCR amplification, as per Ausubel et ah, Current Protocols in Molecular Biolopv. with the PCR 
primer pair. A positive library was then used to isolate clones encoding the gene of interest using the probe 
oligonucleotide and one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 ' - ATGTTTGTGTGG A AGTGCr'Pr'r;-'* ' (SEQ ID NO:78) 

15 forward PCR primer 5-GTCAACATGCTCCTCTGC-3' (SEQ ID NO:79) 

reverse PCR primer S'-AATCCATTGTGCArTGrAfrrTPTAOn-V (SEQ ID NO:80) 

reverse PCR primer 5 ' -G AGC ATGCC ACC A fITG G A rrn A r - v (SEQ ID NO:81) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA44143 

sequence which had the following nucleotide sequence 

20 hybridization probe 

S^GCCGATGCTGTCCTAGTGGAAACAACTCCACTGTAACTAGATTGATCTATGCACO' (SEQ ID 
NO:82) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 

25 to isolate clones encoding the PR0732 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human fetal lung tissue (LIB26). The 
cDNA libraries used to isolate the cDNA clones were constructed by standard methods using commercially 
available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT 
containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately 

30 by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science . 
253:1278-1280 (1991)) in the unique Xhol and NotI sites. 

A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 88-90 and ending at the stop codon found at nucleotide positions 

35 1447-1449 (Figure 40, SEQ ID NO:72). The predicted polypeptide precursor is 453 amino acids long, has a 
calculated molecular weight of approximately 50,419 daltons and an estimated pi of approximately 5.78. 
Analysis of the full-length PR0732 sequence shown in Figure 41 (SEQ ID NO: 73) evidences the presence of 
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the following: a signal peptide from about amino acid 1 to about amino acid 28, transmembrane domains from 
about amino acid 37 to about amino acid 57, from about amino acid 93 to about amino acid 109, from about 
amino acid 126 to about amino acid 148, from about amino acid 151 to about amino acid 172, from about amino 
acid 197 to about amino acid 215, from about amino acid 231 to about amino acid 245, from about amino acid 
260 to about amino acid 279, from about amino acid 315 to about amino acid 333, from about amino acid 384 
5 to about amino acid 403 and from about amino acid 422 to about amino acid 447, potential N-glycosylation sites 
from about amino acid 33 to about amino acid 36, from about amino acid 34 to about amino acid 37, from about 
amino acid 179 to about amino acid 183, from about amino acid 298 to about amino acid 301, from about amino 
acid 337 to about amino acid 340 and from about amino acid 406 to about amino acid 409, an amino acid block 
having homology to the MIP family of proteins from about amino acid 1 19 to about amino acid 149 and an 

10 amino acid block having homology to DNA/RN A non-specific endonuclease proteins from about amino acid 279 
to about amino acid 286. Clone DNA48334-1435 has been deposited with ATCC on June 2, 1998 and is 
assigned ATCC deposit no. 209924. 

Analysis of the amino acid sequence of the full-length PR0732 polypeptide suggests that it possesses 
significant sequence similarity to the Diff33 protein, thereby indicating that PR0732 may be a novel Diff33 

15 homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PR0732 amino acid sequence and the following Dayhoff sequences, 
HS179M20_2, MUSTETUJ, CER11H62, RATDRPl, S51256. E69226, AE000869J, JC4120, 
CYB PARTE and PJR50619. 

20 EXAMPLE 21: Isolation of cDNA clones Encoding Human PROl 120 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is designated herein consen0352. The consen0352 sequence was 

then extended using repeated cycles of BLAST and phrap to extend the consensus sequence as far as possible 

using the sources of EST sequences discussed above. The extended consensus sequence is designated herein as 
25 DNA34365. Based on the DNA34365 consensus sequence, oligonucleotides were synthesized: 1) to identify 

by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the 

full-length coding sequence for PROl 120. 

PCR primers (forward and reverse) were synthesized: 

forward PCR primers : 5 '-GAAGCCGGCTGTCTGAATC-3 ' (SEQ ID NO:85), 
30 S'-GGCCAGCTATCTCCGCAGO' (SEQ ID NO:86), 5 '-AAGGGCCTGC AAGAGAAG-3 1 (SEQ ID NO:87), 

5 * -C ACTGGGAC AACTGTGGG-3 ' (SEQ ID NO:88), 

5 ' -C AG AGGC A ACGTGG AG AG-3 ' (SEQ ID NO:89), and 

5 , -AAGTATTGTCATACAGTGTTC-3' (SEQ ID NO:90); 

reverse PCR primers : 5 ' -T AGT ACTTGGGC ACG AGGTTGG AG-3 ' (SEQ ID NO:9l), and 5*- 
35 TC ATACC AACTGCTGGTC ATTGGC-3 ' (SEQ ID NO:92). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA34365 
consensus sequence which had the following nucleotide sequence: 

394 



WO 99/63088 



PCT/US99/12252 



hybridization probe: 

5 ' -CTC AAGCTGCTGG AC ACGGAGCGGCCGGTG AATCGGTTTC ACTTG -3 ' (SEQ ID NO:93). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 
to isolate clones encoding the PROl 120 gene using the probe oligonucleotide and one of the PCR primers. RNA 
5 for construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROl 120 (designated herein as DNA48606-1479 [Figures 46A-B, SEQ ID NO:83]; and the derived protein 
sequence for PROl 120. 

The entire coding sequence of PROl 120 is shown in Figures 46A-B (SEQ ID NO:83). Clone 
10 DNA48606-1479 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 608-610 and an apparent stop codon at nucleotide positions 3209-3211. The predicted polypeptide 
precursor is 867 amino acids long. The full-length PROl 120 protein shown in Figure 47 has an estimated 
molecular weight of about 100,156 Daltons and a pi of about 9.44. Additional features of the PROl 120 
polypeptide include a signal peptide at about amino acids 1-17; a sulfatase signature at about amino acids 86-98; 
15 regions of homology to sulfatases at about amino acids 87-106, 133-146, 216-229, 291-320, and 365-375; and 
potential N-glycosylation sites at about amino acids 65-68, 112-115, 132-135, 149-152, 171-174, 198-201,241- 
245, 561-564, 608-61 1 , 717-720, 754-757, and 764-767. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 47 (SEQ ID NO: 84), revealed significant 
20 homology between the PRO1120 amino acid sequence and the following Dayhoff sequences: CELK09C4_1, 
GL6S^HUMAN,G65169,NCU89492_1,BCU44852^1,E64903, P_R51355, STSJHUMAN, GA6S HUMAN, 
and IDSJ40USE. Clone DNA48606-1479 was deposited with the ATCC on July 1, 1998, and is assigned 
ATCC deposit no. 203040. 

25 EXAMPLE 22 : Isolation of cDNA clones Encoding Human PRQ537 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated as Incyte EST cluster no. 29605. This EST cluster 
sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST 
databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, 

30 CA) to identify existing homologies. The homology search was performed using the computer program BLAST 
or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA48350. 

35 In light of an observed sequence homology between the DNA48350 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. R63443, the Merck EST clone R63443 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a frill-length protein. 
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The sequence of this cDNA insert is shown in Figure 48 and is herein designated as DNA49141-1431. 

Clone DNA49 14 1-1431 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 97-99 and ending at the stop codon at nucleotide positions 442-444 (Figure 48). The 
predicted polypeptide precursor is 115 amino acids long (Figure 49). The full-length PR0537 protein shown 
in Figure 49 has an estimated molecular weight of about 13, 183 daltons and a pi of about 12. 13. Analysis of 
5 the full-length PR0537 sequence shown in Figure 49 (SEQ ID NO:95) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 31, a potential N-glycosylation site from about 
amino acid 44 to about amino acid 47, potential N-myristolation sites from about amino acid 3 to about amino 
acid 8 and from about amino acid 16 to about amino acid 21 and an amino acid block having homology to 
multicopper oxidase proteins from about amino acid 97 to about amino acid 105. Clone DNA49141-1431 has 

10 been deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203003. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 49 (SEQ ID NO:95), evidenced homology 
between the PR0537 amino acid sequence and the following Dayhoff sequences: A54523, CELF22H10_2, 
FKH4_MOUSE,OTX1_HUMAN,URB1_USTMA,KNOB_PLAFN,A32895^1,AF036332_1,HRG^HUMAN 

15 and HRP3_PLAFS. 

EXAMPLE 23 : Isolation of cDNA clones Encoding Human PRQ536 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated herein as ss.clu2437.init. This EST cluster sequence was 

20 then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 

25 assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA48351. 

In light of an observed sequence homology between the DNA48351 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H 1 1 129, the Merck EST clone HI 1 129 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

30 The sequence of this cDNA insert is shown in Figure 50 and is herein designated as DNA49 142- 1430. 

Clone DNA49142-1430 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 48-50 and ending at the stop codon at nucleotide positions 987-989 (Figure 50). The 
predicted polypeptide precursor is 313 amino acids long (Figure 51). The full-length PR0536 protein shown 
in Figure 51 has an estimated molecular weight of about 34, 189 daltons and a pi of about 4.8. Analysis of the 

35 full-length PR0536 sequence shown in Figure 51 (SEQ ID NO:97) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 25, a potential N-glycosylation site from about amino 
acid 45 to about amino acid 48 and an amino acid sequence block having homology to sulfatase proteins from 
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about amino acid 16 to about amino acid 26. Clone DNA49142-1430 has been deposited with ATCC on June 
23, 1998 and is assigned ATCC deposit no. 203002. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 51 (SEQ ID NO: 97), evidenced homology 
between the PR0536 amino acid sequence and the following Dayhoff sequences: APU46857_1, PK2 DICDI, 
5 H64743, F5I14J8, CEAM_ECOLI, GEN14267, H64965, TCU39815J, PSBJ_ODOSI and P_R06980. 

EXAMPLE 24: Isolation of cDNA clones Encoding Human PRQ535 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated herein as ss.clul2694.init. This EST cluster sequence was 

10 then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 

15 assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA48352. Two 
propietary Genentech EST sequences were employed in the assembly are are herein shown in Figures 54 and 
55. 

In light of an observed sequence homology between the DNA48352 consensus sequence and an EST 
20 sequence encompassed within the Merck EST clone no. H86994, the Merck EST clone H86994 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 52 and is herein designated as DNA49 143- 1429. 

Clone DNA49 143- 1429 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 78-80 and ending at the stop codon at nucleotide positions 681-683 (Figure 52). The 
25 predicted polypeptide precursor is 201 amino acids long (Figure 53). The full-length PR0535 protein shown 
in Figure 53 has an estimated molecular weight of about 22,180 daltons and a pi of about 9.68. Analysis of the 
full-length PR0535 sequence shown in Figure 53 (SEQ ID NO:99) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 25, a transmembrane domain from about amino acid 
155 to about amino acid 174, a potential N-glycosylation site from about amino acid 196 to about amino acid 
30 199 and FKBP-type peptidyl-prolyl cis-trans isomer signature sequences from about amino acid 62 to about 
amino acid 77, from about amino acid 87 to about amino acid 123 and from about amino acid 128 to about amino 
acid 141. Clone DNA49143-1429 has been deposited with ATCC on June 23, 1998 and is assigned ATCC 
deposit no. 203013. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST- sequence 
35 alignment analysis of the full-length sequence shown in Figure 53 (SEQ ID NO:99), evidenced homology 
between the PR0535 amino acid sequence and the following Dayhoff sequences: S71237, PR93551, PR28980, 
S71238, FKB2 HUMAN, CELC05C8J, S55383, S72485, CELC50F2_6 and S75144. 
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EXAMPLE 25 : Isolation of cDNA clones Encoding Human PRQ718 

A cDNA sequence isolated in the amylase screen described in Example 2 (human fetal lung library) 
above is herein designated DNA43512 (see Figure 62; SEQ ID NO: 108). The DNA43512 sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to 
5 identity existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA45625. Proprietary 

10 Genentech EST sequences were employed in the assembly and are herein shown in Figures 58-61 , 

Based on the DNA45625 sequence, oligonucleotide probes were generated and used to screen a human 
fetal lung library (LIB25) prepared as described in paragraph 1 of Example 2 above. The cloning vector was 
pRK5B (pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science , 
253:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp. 

15 PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5 ' -GGGTGG ATGGT ACTGCTGC ATCC-3 ' (SEQ ID NO: 109) 
reverse PCR primer 5 ' -TGTTGTGCTGTGGGAAATC AG ATGTG-3 1 (SEQ ID NO: 1 10) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA45625 sequence 
which had the following nucleotide sequence: 

20 hybridization probe 

5 ' -GTGTCTGG AGGCTGTGGCCGTTTTGTTTTCTTGGGCT A A AATCGGG-3 ' (SEQ ID NO: 111) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0718 gene using the probe oligonucleotide and one of the PCR primers. 

25 A full length clone was identified that contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 36-38 and ending at the stop codon found at nucleotide positions 
607-609 (Figure 56; SEQ ID NO: 102). The predicted polypeptide precursor is 157 amino acids long, has a 
calculated molecular weight of approximately 17,400 daltons and an estimated pi of approximately 5.78. 
Analysis of the full-length PR0718 sequence shown in Figure 57 (SEQ ID NO: 103) evidences the presence of 

30 the following: a type II transmembrane domain from about amino acid 21 to about amino acid 40, and other 
transmembrane domains at about amino acid 58 to about amino acid 78, about amino acid 95 to about amino acid 
1 14, and about amino acid 127 to about amino acid 147; a cell attachment sequence from about amino acid 79 
to about amino acid 81; and a potential N-glycosylation site from about amino acid 53 to about amino acid 56. 
Clone DNA49647- 1398 has been deposited with ATCC on June 2, 1998 and is assigned ATCC deposit no. 

35 209919. 

Analysis of the amino acid sequence of the full-length PR0718 polypeptide suggests that it possesses 
no significant sequence similarity to any known protein. However, an analysis of the Dayhoff database (version 
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35.45 SwissProt 35) evidenced some degree of homology between the PR0718 amino acid sequence and the 
following Dayhoff sequences: AF045606J, AF039906J, SPBC8D2_2, S63441, F64728, COX1JTRYBB, 
F64375, E64173, RPYGJTJ3, MTCY261_23. 

EXAMPLE 26 : Isolation of cDNA clones Encoding Human PRQ872 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

Incyte EST sequence designated herein as clul 20709. ink. The clul20709.init sequence was then compared a 

proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 

homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
10 in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

DNA sequence with the program M phrap" (Phil Green, University of Washington, Seattle, Washington). The 

consensus sequence obtained therefrom is herein designated DNA48254. 

In light of an observed sequence homology between the DNA48254 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3438068, the Incyte EST clone 3438068 was purchased 
15 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

The sequence of this cDNA insert is shown in Figure 63 and is the full-length DNA sequence for PR0872. 

Clone DNA49819-1439 was deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit no. 

209931. 

The entire nucleotide sequence of DNA49819-1439 is shown in Figure 63 (SEQ ID NO: 1 12). Clone 
20 DNA49819-1439 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 14-16 and ending at the stop codon at nucleotide positions 1844-1846 (Figure 63). The predicted 
polypeptide precursor is 610 amino acids long (Figure 64). The full-length PR0872 protein shown in Figure 
64 has an estimated molecular weight of about 66,820 daltons and a pi of about 8.65. Analysis of the full-length 
PR0872 sequence shown in Figure 64 (SEQ ID NO: 113) evidences the presence of the following features: a 
25 signal peptide at amino acid 1 to about 18, putative transmembrane domains at about amino acids 70-87, 200-222 
and 568-588; sequence identity with bacterial-type phytoene dehydrogenase protein at about amino acids 7 1 - 105 ; 
sequence identity with a regulator of chromosome condensation (RCC1) signature 2 at about amino acids 201- 
211; leucine zipper patterns at about amino acids 214-235, 221-242, 228-249 and 364-385; a potential N- 
glycosylation site at about amino acids 271-274; and a glycosaminoglycan attachment site at about amino acids 
30 75-78. Analysis of the amino acid sequence of the full-length PR0872 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced homology between the PR0872 amino acid sequence and the following 
Dayhoff sequences: PRCRTIJ, S75951, S74689, CELF37C4J, CRTI_RHOCA, S76617, YNI2_METTL, 
MTV014J4, AOFB HUMAN, and MMU70429J. 

35 EXAMPLE 27: Isolation of cD NA clones Encoding Human PRO1063 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as ss.clull9743.init. The Incyte EST cluster sequence 

399 



WO 99/63088 



PCT/US99/12252 



ss.clul 19743. ink sequence was then compared to a variety of expressed sequence tag (EST) databases which 
included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2(Altshuletal., Methods in Enzvmolopy 266:460-480 (1996)), Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
5 proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 
University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 
designated DNA48288. 

In light of an observed sequence homology between the DNA48288 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2783726, the Incyte EST clone 2783726 was purchased 

10 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 65 and is herein designated DNA49820-1427. 

The full length clone shown in Figure 65 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 90-92 and ending at the stop codon found at nucleotide positions 
993-995 (Figure 65; SEQ ID NO:l 14). The predicted polypeptide precursor is 301 amino acids long, has a 

15 calculated molecular weight of approximately 33,530 daltons and an estimated pi of approximately 4.80. 
Analysis of the full-length PRO1063 sequence shown in Figure 66 (SEQ ID NO: 1 15) evidences the presence of 
the following: a signal peptide from about amino acid 1 to about amino acid 21, potential N-glycosylation sites 
from about amino acid 195 to about amino acid 198, from about amino acid 217 to about amino acid 220 and 
from about amino acid 272 to about amino acid 275, a glycosaminoglycan attachment site from about amino acid 

20 267 to about amino acid 270, a microbodies C-terminal targeting signal site from about amino acid 299 to about 
amino acid 301, a type II fibronectin collagen-binding domain homology sequence from about amino acid 127 
to about amino acid 168 and a fructose-bisphosphate aldolase class II protein homology sequence from about 
amino acid 101 to about amino acid 118. Clone DNA49820-1427 has been deposited with the ATCC on June 
2, 1998 and is assigned ATCC deposit no. 209932. 

25 Analysis of the amino acid sequence of the full-length PRO 1063 polypeptide suggests that it possesses 

sequence similarity to the human type IV collagenase protein. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PRO1063 amino acid 
sequence and the following Dayhoff sequences, S68303, CFU68533 1, PP91139, RNU65656 1, 
PA2R_RABIT, MMU56734 1, FINCXENLA, A48925, P R92778 and FA12HUMAN. 

30 

EXAMPLE 28 : Isolation of cDNA clones Encoding Human PRQ619 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated herein as 88434. This EST cluster sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
35 GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 
(Altshul et ah, Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
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of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 1656694, the Incyte EST clone 1656694 was purchased and the 
5 cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 67 and is herein designated as DNA4982 1-1562. 

The full length clone shown in Figure 67 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 81-83 and ending at the stop codon found at nucleotide positions 
450-452 (Figure 67; SEQ ID NO: 1 16). The predicted polypeptide precursor (Figure 68, SEQ ID NO: 1 17) is 
10 123 amino acids long including a predicted signal peptide at about amino acids 1-20. PR0619 has a calculated 
molecular weight of approximately 13,710 daltons and an estimated pi of approximately 5.19. Clone 
DNA49821-1562 was deposited with the ATCC on June 16, 1998 and is assigned ATCC deposit no. 209981. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 68 (SEQ ID NO: 117), revealed significant 
15 homology between the PR0619 amino acid sequence and the following Dayhoff sequences: S35302, D87009_l, 
HSU93494J, HUMIGLAM5J, D86999_2, HUMIGLYMI J, HUMIGLYMKEJ, A29491J, A29498J, 
and VPR2_MOUSE. 

EXAMPLE 29 : Isolation of cDN A clones Encoding Human PRQ943 

20 A consensus DNA sequence encoding PR0943 was assembled relative to other EST sequences using 

phrap as described in Example I above. This consensus sequence was then extended using repeated cycles of 
BLAST and phrap to extend the consensus sequence as far as possible using the sources of EST sequences 
discussed above. The extended consensus sequence is herein designated DNA36360. Based on the DNA36360 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 

25 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR0943. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5'-CGAGATGACGCCGAGCCCCC-3' (SEQ ID NO: 120) 
reverse PCR primer 5 ' -CGGTTCG AC ACGCGGC AGGTG-3 ' (SEQ ID NO: 121) 
30 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA36360 
sequence which had the following nucleotide sequence 
hybridization probe 

5 '-TGCTGCTCCTGCTGCCGCCGCTGCTGCTGGGGGCCTTCCCGCCGG-3 ' (SEQ ID NO: 122) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0943 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal brain tissue. 
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DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0943 (designated herein as DNA52 192- 1369 [Figure 69, SEQ ID NO: 1 18]) and the derived protein sequence 
for PR0943. 

The entire nucleotide sequence of DNA52192-1369 is shown in Figure 69 (SEQ ID NO:l 18). Clone 
DNA52 192- 1369 contains a single open reading frame with an apparent translational initiation site at nucleotide 
5 positions 150-152 and ending at the stop codon at nucleotide positions 1662-1664 (Figure 69). The predicted 
polypeptide precursor is 504 amino acids long (Figure 70). The full-length PR0943 protein shown in Figure 
70 has an estimated molecular weight of about 54,537 daltons and a pi of about 10.04. Analysis of the full- 
length PR0943 sequence shown in Figure 70 (SEQ ID NO: 119) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 17, a transmembrane domain from about amino acid 376 

10 to about amino acid 396, tyrosine kinase phosphorylation sites from about amino acid 212 to about amino acid 
219 and from about amino acid 329 to about amino acid 336, potential N-glycosylation sites from about amino 
acid 1 1 1 to about amino acid 1 14, from about amino acid 23 1 to about amino acid 234, from about amino acid 
255 to about amino acid 258 and from about amino acid 293 to about amino acid 296 and an immunoglobulin 
and MHC protein sequence homology block from about amino acid 219 to about amino acid 236. Clone 

15 DNA52192-1369 has been deposited with ATCC on July 1, 1998 and is assigned ATCC deposit no. 203042. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 70 (SEQ ID NO: 119), evidenced significant 
homology between the PR0943 amino acid sequence and the following Dayhoff sequences: B49151, A39752, 
FGR1JCENLA, S38579, RATHBFGFRBl , TVHU2F, FGR2_MOUSE, CEK3_CHICK, P_R21080 and 

20 A27171J. 

EXAMPLE 30: Isolation of cDNA clones Encoding Human PROl 188 

A consensus DNA sequence was assembled relative to other EST sequences using the program "phrap" 
as described in Example 1 above. This consensus sequence is designated herein as DNA45679. Based on the 
25 DNA45679 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that 
contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence 
for PROl 188. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 -CTGGTGCCTCA ACAGGGAGC AG-3 ' (SEQ ID NO: 125) 
30 reverse PCR primer 5 r -CC ATTGTGCAGGTC AGGTC ACAG-3 ' (SEQ ID NO: 126) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA45679 sequence which had the following nucleotide sequence: 
hybridization probe 

5 ' -CTGG AGCAAGTGCTC AGCTGCCTGTGGTC AG ACTGGGGTC-3 * (SEQ ID NO: 127) 
35 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 188 gene using the probe oligonucleotide and one of the PCR primers. RNA 
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for construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01188 (designated herein as DNA52598-1518 [Figure 71, SEQ ID NO: 123]); and the derived protein 
sequence for PROl 188. 

The entire coding sequence of PROl 188 is shown in Figure 71 (SEQ ID NO: 123). Clone DNA52598- 
5 1518 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
136-138 and an apparent stop codon at nucleotide positions 3688-3690. The predicted polypeptide precursor 
is 1184 amino acids long. The full-length PROl 188 protein shown in Figure 72 has an estimated molecular 
weight of about 132,582 Daltons and a pi of about 8.80. Additional features include: a signal peptide at about 
amino acids 1-31; an ATP/GTP binding site motif A (P-Ioop) at about amino acids 266-273; an aldehyde 
10 dehydrogenases cysteine active site at about amino acids 1 88- 1 99; growth factor and cytokines receptors family 
signature 2 at about amino acids 153-159; and potential N-glycosylation sites at about amino acids 129-132, 132- 
135, 346-349, 420-423, 550-553, 631-634, 1000-1003, and 1056-1059. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 72 (SEQ ID NO: 124), revealed significant 
15 homology between the PR01188 amino acid sequence and the following Dayhoff sequences: SSU831141, 
S56015, CET21B6_4, CELT19D2J, and TSP1 MOUSE. 

Clone DNA52598-1518 has been deposited with ATCC and is assigned ATCC deposit no 203107. 

EXAMPLE 31 : Isolation of cDNA clones Encoding Human PROl 133 

20 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This sequence was extended using repeated cycles of phrap. The extended consensus 
sequence is designated herein DNA38102. Based on the DNA38102 consensus sequence, oligonucleotides were 
synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as 
probes to isolate a clone of the full-length coding sequence for PROl 133. 

25 PCR primers (two forward and one reverse) were synthesized: 

forward PCR primer 1 5 ' -TCG ATT ATGG ACG A AC ATCiCC a Pif-^ ' (SEQ ID NO: 130); 
forward PCR primer 2 5 ' -TTCTG AG ATCCCTP ATPf'TP-'* ' (SEQ ID NO: 131); and 
reverse primer 5'-AGGTTCAGGGACAGCAAGTTTGGG-T (SEQ ID NO: 132). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 

30 DNA38102 sequence which had the following nucleotide sequence: 
hybridization probe 

5'TTTGCTGGACCTCGGCTACGGAATTGGCTTCCCTCTACGGACAGCTGGAT3' (SEQ ID NO: 133). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with a PCR primer pair identified above. A positive library was then used to 
35 isolate clones encoding the PROl 133 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue. 
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DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROl 133 and the derived protein sequence for PR01133. 

The entire coding sequence of PROl 133 is shown in Figure 73 (SEQ ID NO: 128). Clone DNA53913- 
1490 contains a single open reading frame with an apparent translation^ initiation site at nucleotide positions 
266-268 and an apparent stop codon at nucleotide positions 1580-1582 of SEQ ID NO: 128. The predicted 
5 polypeptide precursor is 438 amino acids long. The signal peptide is at amino acids 1-18 of SEQ ID NO: 129. 
EGF-Iike domain cysteine pattern signatures start at 315 and 385 of SEQ ID NO: 129 as shown in Figure 74. 
Clone DNA53913-1490 has been deposited with ATCC and is assigned ATCC deposit no. 203162. The full- 
length PROl 133 protein shown in Figure 74 has an estimated molecular weight of about 49,260 daltons and a 
pi of about 6.15. 

10 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

alignment analysis of the full-length sequence shown in Figure 74 (SEQ ID NO: 129), revealed some sequence 
identity between the PRO 1133 amino acid sequence and the following Dayhoff sequences (data from the database 
incorporated herein): AF002717J, LMG1_HUMAN, B54665, UNC6_CAEEL, LML1CAEEL, 
LMA5_MOUSE, MMU88353J, LMA1HUMAN, HSLN2C64 1 and AF005258J. 

15 

EXAMPLE 32 : Isolation of cDNA clones Encoding Human PRQ784 

An initial DNA sequence (SEQ ID NO: 136), referred to herein as DNA44661 and shown in Figure 77, 
was identified using a yeast screen, in a human fetal lung cDNA library that preferentially represents the 5' ends 
of the primary cDNA clones. DNA44661 was then compared to ESTs from public databases (e.g., GenBank), 

20 and a proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA), using the computer 
program BLAST or BLAST2 [Altschul et al., Methods in Enzvmologv. 266:460-480 (1996)]. The ESTs were 
then clustered and assembled into a consensus DNA sequence using the computer program "phrap" (Phil Green, 
University of Washington, Seattle, Washington). The consensus sequence obtained is designated herein as 
"DNA45463\ Based on the DNA45463 consensus sequence, oligonucleotides were synthesized for use as 

25 probes to isolate a clone of the full-length coding sequence for PR0784 from a human fetal lung cDNA library. 

The full length DNA53978-1443 clone shown in Figure 75 contained a single open reading frame with 
an apparent translational initiation site at nucleotide positions 37-39 and ending at the stop codon found at 
nucleotide positions 821-823 (Figure 75; SEQ ID NO: 134). The predicted polypeptide precursor (Figure 76, 
SEQ ID NO: 135) is 228 amino acids long. PR0784 has a calculated molecular weight of approximately 25,735 

30 Daltons and an estimated pi of approximately 5 .45 . PR0784 has the following features: a signal peptide at about 
amino acid 1 to about 15; transmembrane domains at about amino acids 68 to about 87 and at about 183 to about 
204; potential N-myristoylation sites at about amino acids 15-20, 51-56, 66-60, 163-168, and 206-21 1; and an 
RNP-i protein RNA-binding region at about amino acids 108 to about 117. 

Clone DNA53978-1443 was deposited with ATCC on June 16, 1998, and is assigned ATCC deposit 

35 no. 209983. 

Based on a BLAST and FastA sequence alignment analysis (using the ALIGN computer program) of 
the full-length sequence, PR0784 shows amino acid sequence identity to the following proteins: RNU42209J, 
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MMU91538J, CGU91742J, CELF55A4J5, SC22_YEAST, and F48188. 

EXAMPLE 33: Isolation of cDNA Clones Encoding Human PRQ783 

A yeast screening assay was employed to identify cDNA clones that encoded potential secreted proteins. 
Use of this yeast screening assay allowed identification of a single cDNA clone, designated herein as DNA45201 
5 (Figure 80; SEQ ID NO: 139). 

The DNA45201 sequence was then used to search expressed sequence tag (EST) databases for the 
presence of potential homologies. The EST databases included public EST databases (e.g., GenBank) and a 
proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA). The search was 
performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in Enzvmologv 266:460- 
10 480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 
encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" 
(Phil Green, Univ. of Washington, Seattle, Washington). The consensus sequence obtained is herein designated 
as "consenOl". A proprietary Genentech EST sequence was used in the consensus assembly and is herein 
designated as DNA 14575 (Figure 81; SEQ ID NO: 140). 
15 Based on the consenOl sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA 

library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 
sequence for PR0783. In order to screen several libraries for a full-length clone, DNA from the libraries was 
screened by PCR amplification, as per Ausubel et al., Current Protocols in Molecular Biolopv. with the PCR 
primer pair. A positive library was then used to isolate clones encoding the gene of interest using the probe 
20 oligonucleotide and one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 ' -G ACTGT ATCTG AGCCCC AO ACTOr-l 1 (SEQ ID NO: 141), 
forward PCR primer 5 f -TCAGCAATG AOQTGCTGCTP-1 ' (SEQ ID NO: 142), and 
reverse PCR primer 5 '-TGAGGAAGATGAGGGAC AGnTTGCj-V (SEQ ID NO: 143). 
25 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consenOl 

sequence which had the following nucleotide sequence: 
hybridization probe 

5 ' -T ATGGAAGC ACCTG ACTACG AAGTGCTATCCGTGCG AG AAC AGCT ATTCC-3 ' (SEQ ID NO: 144). 
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

30 screened by PCR amplification with a PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0783 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB228). 
The cDNA libraries used to isolate the cDNA clones were constructed by standard methods using commercially 
available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT 

35 containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately 
by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science . 

405 



WO 99/63088 



PCT/US99/12252 



253 : 1278-1280 (1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0783 [herein designated as DNA53996-1442] (SEQ ID NO: 137) and the derived protein sequence for 
PR0783. 

The entire nucleotide sequence of DNA53996-1442 is shown in Figure 78 (SEQ ID NO: 137). Clone 
5 DNA53996-1442 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 310-312 and ending at the stop codon at nucleotide positions 1777-1779 (Figure 78). The predicted 
polypeptide precursor is 489 amino acids long (Figure 79). The full-length PR0783 protein shown in Figure 
79 has an estimated molecular weight of about 55,219 daltons and a pi of about 8.47. Analysis of the full-length 
PR0783 sequence shown in Figure 79 (SEQ ID NO: 138) evidences the presence of the following features: 
10 transmembrane domains located at about amino adds 23-42 , 67-89, 111-135, 154-176, 194-218, 296-319, 
348-370, 387-410 and 427-452; leucine zipper patterns located at about amino acids 263-283 and 399-420; a 
potential tyrosine kinase phosphorylation site at about amino acids 180-187; potential N-glycosylation sites at 
about amino acids 105 -108 and 121-124; potential cAMP- and a cGMP-dependent protein kinase 
phosphorylation site at about amino acids 288-291; and a region having sequence identity with bacterial 
15 rhodopsins retinal binding site protein at about amino acids 190-218. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35) shows some sequence identity 
between the PR0783 amino acid sequence and the following Dayhoff sequences: YNC2CAEEL, D64048, 
ATACO02332JF4P9.3, NY2RSHEEP, and VSH_MUMPA. 

Clone DNA53996-1442 was deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit 
20 no. 209921. 

EXAMPLE 34 : Isolation of cDN A Clones Encoding Human PRO820 

An expressed sequence tag (EST) DNA database (Merck/ Wash . U) was searched and an EST designated 
EST no. AA504080, Merck clone 825136, was identified (library 312, human B-cell tonsil). Homology searches 
25 revealed that this EST showed sequence identity with low affinity immunoglobulin gamma Fc receptor II. DNA 
sequencing gave the full-length DNA sequence for PRO820 and the derived protein sequence for PRO820. 

The entire nucleotide sequence of DNA56041-1416 is shown in Figure 82 (SEQ ID NO: 145). Clone 
DNA5604 1-14 16 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 115-117 and ending at the stop codon at nucleotide positions 487-489 (Figure 82). The predicted 
30 polypeptide precursor is 124 amino acids long (Figure 83). The full-length PRO820 protein shown in Figure 
83 has an estimated molecular weight of about 14,080 daltons and a pi of about 7.48. Clone DNA56041-1416 
has been deposited with ATCC. Regarding the sequence, it is understood that the deposited clone contains the 
correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Still analyzing the amino acid sequence of SEQ ID NO: 146, the putative signal peptide is at about amino 
35 acids 1-15 of SEQ ID NO: 146. Protein kinase C phosphorylation sites are at about amino acids 20-22 and 43-45 
of SEQ ID NO: 146. An N-myristoylation site is at about amino acids 89-94 of SEQ ID NO: 146. An 
immunoglobulin and major histocompatibility complex domain is at about amino acids 83-90 of SEQ ID NO: 146. 
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The corresponding nucleotides can be routinely determined given the sequences provided herein. 

EXAMPLE 35 : Isolation of cDNA Clones Encoding Human PRO 1080 

A consensus DNA sequence was assembled relative to other EST sequences using phrap and was 
extended using repeated cycles of BLAST and phrap so as to extend the consensus sequence as far as possible 
5 using the sources of the EST sequences as described in Example 1 above. The consensus sequence is designated 
herein as DNA52640. An EST proprietary to Genentech was employed in the consensus assembly and is herein 
designated as DNA36527 (Figure 86; SEQ ID NO: 149). 

In light of an observed sequence homology between the DNA36527 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 526423, the Merck EST clone 526423 was purchased 
10 and the cDNA insert was obtained and sequenced: It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 84 and is herein designated as DNA56047-1456. 

The entire nucleotide sequence of DNA56047-1456 is shown in Figure 84 (SEQ ID NO: 147). Clone 
DNA56047-1456 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 159-161 and ending at the stop codon at nucleotide positions 1233-1235 of SEQ ID NO: 147 (Figure 
15 84) . The predicted polypeptide precursor is 358 amino acids long (Figure 85) . The full-length PRO 1080 protein 
shown in Figure 85 has an estimated molecular weight of about 40,514 daltons and a pi of about 6.08. Clone 
DNA56047-1456 has been deposited with ATCC on June 9, 1998. It is understood that the deposited clone has 
the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

20 Also shown in Figure 85 are the approximate locations of the signal peptide, cell attachment site, Nt- 

DnaJ domain signature, region having sequence identity with Nt-DnaJ domain proteins, and N-glycosylation 
sites. The corresponding nucleic acids of these amino acid sequences and others provided herein can be 
routinely determined by the information provided herein. 

25 EXAMPLE 36 : Isolation of cDNA Clones Encoding Human PRO 1079 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above, and is herein designated DNA52714. Based on information provided by the assembly, the 
clone for Merck EST no. H06898 was obtained and sequenced, thereby giving the nucleotide sequence 
designated herein as DNA56050-1455. The entire nucleotide sequence of DNA56050-1455 is shown in Figure 

30 87 (SEQ ID NO: 150). Clone DNA56050-1455 contains a single open reading frame with an apparent 
translational initiation site at nucleotide positions 183-185 and ending at the stop codon at nucleotide positions 
861-863 (Figure 87). The predicted polypeptide precursor is 226 amino acids long (Figure 88). The full-length 
PRO1079 protein shown in Figure 88 has an estimated molecular weight of about 24,61 1 Daltons and a pi of 
about 4.85. Analysis of the full-length PRO1079 sequence shown in Figure 88 (SEQ ID NO:3) evidences the 

35 presence of the following features: a signal peptide at about amino acid 1-29; potential N-myristoylation sites 
at about amino acids 10-15, and 51-56; homology to photosystem I psaG and psaK proteins at about amino acids 
2 to 20; and homology to prolyl endopeptidase family serine proteins at about amino acids 150 to 163. 
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Analysis of the amino acid sequence of the full-length PRO 1079 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced some sequence identity between the PRO 1079 amino acid sequence and 
the following Dayhoff sequences: CEK10C3_4, MMU50734J, D69503, AF051 149J, and VSMPJTVMS. 

Clone UNQ536 (DNA56050-1455) was deposited with the ATCC on June 22, 1998, and is assigned 
ATCC deposit no. 203011. 

5 

EXAMPLE 37: Isolation of cDNA clones Encoding Human PRQ793 

A cDNA clone (DNA56 110-1437) encoding a native human PR0793 polypeptide was identified by a 
yeast screen, in a human skin tumor cDNA library that preferentially represents the 5' ends of the primary 
cDNA clones. The yeast screen employed identified a single EST clone designated herein as DNA50177 (Figure 

10 91; SEQ ID NO: 154). The DNA50177 sequence was then compared to various EST databases including public 
EST databases (e.g. , GenBank), and a proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, 
CA) to identify homologous EST sequences. The comparison was performed using the computer program 
BLAST or BLAST2 [Altschul et al., Methods in Enzvmology. 266:460-480 (1996)]. Those comparisons 
resulting in a BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were 

15 clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of 
Washington, Seattle, Washington). This consensus sequence is herein designated DNA50972. 

In light of an observed sequence homology between the DNA50972 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. N33874, the Merck EST clone N33874 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

20 The sequence of this cDNA insert is shown in Figure 89 and is herein designated as DNA561 10-1437. 

The full-length DNA561 10-1437 clone shown in Figure 89 contains a single open reading frame with 
an apparent translation^ initiation site at nucleotide positions 77-79 and ending at the stop codon at nucleotide 
positions 491-493 (Figure 89). The predicted polypeptide precursor is 138 amino acids long (Figure 90). The 
full-length PR0793 protein shown in Figure 90 has an estimated molecular weight of about 15,426 daltons and 

25 a pi of about 10.67. Analysis of the full-length PR0793 sequence shown in Figure 90 (SEQ ID NO: 153) 
evidences the presence of the following: transmembrane domains from about amino acid 12 to about amino acid 
30, from about amino acid 33 to about amino acid 52, from about amino acid 69 to about amino acid 89 and 
from about amino acid 93 to about amino acid 109, potential N-myristolation sites from about amino acid 1 1 to 
about amino acid 16, from about amino acid 51 to about amino acid 56 and from about amino acid 1 16 to about 

30 amino acid 121 and an amino acid sequence block having homology to an aminoacyl-transfer RNA synthetase 
class-II protein from about amino acid 49 to about amino acid 59. Clone DNA561 10-1437 has been deposited 
with ATCC on August 11, 1998 and is assigned ATCC deposit no. 203113. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 90 (SEQ ID NO: 153), evidenced certain 

35 homology between the PR0793 amino acid sequence and the following Dayhoff sequences: S47453, 
AF015193J2, MTEHGNS9_2, E64030, H69784, D64995, CD53_MOUSE, GEN8006, AE001 138J7 and 
COX2_STRPU. 
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EXAMPLE 38 : Isolation of cDNA Clones Encoding Human PRO1016 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. The consensus sequence obtained is herein designated DNA53502. 

In light of an observed sequence homology between the DNA53502 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 38680, the Merck EST clone 38680 was purchased and 
5 the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 92. 

The entire nucleotide sequence of DNA561 13-1378 is shown in Figure 92 (SEQ ID NO: 155). Clone 
DNA561 13-1378 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 168-170 and ending at the stop codon at nucleotide positions 1302-1304 (Figure 92). The predicted 
10 polypeptide precursor is 378 amino acids long (Figure 93). The full-length PRO1016 protein shown in Figure 
93 has an estimated molecular weight of about 44,021 daltons and a pi of about 9.07. Clone DNA561 13-1378 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO 10 16 polypeptide suggests that portions of 
15 it possess sequence identity with acyltransferase, thereby indicating that PRO1016 may be a novel 
acyltransferase. 

Still analyzing the amino acid sequence of SEQ ID NO: 156, the putative signal peptide is at about amino 
acids 1-18 of SEQ ID NO: 156. The transmembrane domain(s) are at about amino acids 332-352 and 305-330 
of SEQ ID NO: 156. The fructose-bisphosphate aldolase class-II protein homology sequence is at about amino 
20 acids 73-90 of SEQ ID NO: 156. The extradiol ring-cleavage dioxygenase protein is at about amino acids 252- 
275 of SEQ ID NO: 156. The corresponding nucleotides can be routinely determined given the sequences 
provided herein. 

The specific Dayhoff database designation names of sequences to which PRO1016 has sequence identity 
with include the following: S52645, PR59712, P R99249, P_R59713, BNAGPATRFl, CELT05H4J5 and 
25 CELZK40J. 

EXAMPLE 39:Isolation of cDNA Encoding Human PRO1013 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. The consensus DNA sequence was then extended using repeated cycles of BLAST and 

30 phrap to extend the consensus sequence as far as possible using the sources of EST sequences. 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 3107695, the Incyte EST clone 3107695 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 94 and is herein designated as DNA56410-1414. 

35 The entire nucleotide sequence of DNA56410-1414 is shown in Figure 94 (SEQ ID NO: 157). Clone 

DNA56410-1414 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 17-19 and ending at the stop codon at nucleotide positions 1244-1246 (Figure 94). The predicted 
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polypeptide precursor is 409 amino acids long (Figure 95). The full-length PRO 10 13 protein shown in Figure 
95 has an estimated molecular weight of about 46,662 daltons and a pi of about 7. 18. Clone DNA564 10-1414 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Still analyzing the amino acid sequence of SEQ ID NO: 158, the putative signal peptide is at about amino 
5 acids 1-19 of SEQ ID NO: 158. N-glycosylation sites are at about amino acids 75-78 and 322-325 of SEQ ID 
NO: 158. An N-myristpylation site is at about amino acids 184-189 of SEQ ID NO: 158. A growth factor and 
cytokine receptor family domain is at about amino acids 134-149 of SEQ ID NO: 158. The corresponding 
nucleotides can be routinely determined given the sequences provided herein. 

Blast analysis showed some sequence identity with other proteins. Specifically, PRO1013 has some 
10 sequence identity with at least the Dayhoff sequences designated: D63877 1 ; MHU22019J , AE000730 JO, and 
AF019079J. 

EXAMPLE 40 : Isolation of cDNA Clones Encoding Human PRQ937 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. That consensus sequence is herein designated DNA49651. Based on the DNA49651 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR0937. 

PCR primers (forward and reverse) were synthesized: 
20 forward PCR primer 5-CTCCGTGGTAAACCCCACAGCCC-3* (SEQ ID NO: 161); and 
reverse PCR primer 5 '-TC ACATCGATGGGATCC ATG ACCG-3 ' (SEQ ID NO: 162). 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA48651 sequence 
which had the following nucleotide sequence: 
hybridization probe 

25 5'-GGTCTCGTGACTGTGAAGCCATGTTACAACTACTGCTCAAACATCATGAG-3' (SEQ ID NO: 163). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0937 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 

30 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR0937 [herein designated as DNA56436-1448] (SEQ ID NO: 159) and the derived protein sequence for 
PR0937. 

The entire nucleotide sequence of DNA56436-1448 is shown in Figure 96 (SEQ ID NO: 159). It 
contains a single open reading frame having an apparent translational initiation site at nucleotide positions 499- 
35 501 and ending at the stop codon found at nucleotide positions 2167-2169 (Figure 96, SEQ ID NO: 159). The 
predicted polypeptide precursor is 556 amino acids long, has a calculated molecular weight of approximately 
62,412 daltons and an estimated pi of approximately 6.62. Analysis of the full-length PR0937 sequence shown 
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in Figure 97 (SEQ ID NO: 160) evidences the presence of the following features: signal peptide at about amino 
acids 1-22; ATP/GTP-binding site motif A (P-loop) at about amino acids 515-523; a potential N-glycosylaiion 
site at about amino acids 514-517; and sites of glypican homology at about amino acids 54-74, 106-156, 238- 
279, 309-345, 423-459, and 468-505. 

Clone DNA56436-1448 has been deposited with ATCC on May 27, 1998, and is assigned ATCC 
5 deposit no. 209902. 

Analysis of the amino acid sequence of the full-length PR0937 polypeptide suggests that it possesses 
significant sequence similarity to glypican proteins, thereby indicating that PR0937 may be a novel glypican 
protein. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PR0937 amino acid sequence and the following Dayhoff sequences: 
10 GPCK_MOUSE, GPC2JRAT, GPC5JIUMAN, GPC3JIUMAN, P_R30168, CEC03H12_2, GEN13820, 
HS119E23J, HDAC_DROME, and AF0 17637J. 

EXAMPLE 41 : Isolation of cDNA clones Encoding Human PRQ842 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

15 Incyte EST cluster sequence designated herein as Incyte EST cluster sequence no. 69572. This EST cluster 
sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST 
databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, 
CA) to identify existing homologies. The homology search was performed using the computer program BLAST 
or BLAST2 (Altshul et ah, Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 

20 BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA54230. 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Merck EST clone no. AA477092, the Merck EST clone AA477092 was purchased and 

25 the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 98 and is herein designated as DNA56855-1447. 

The full length clone shown in Figure 98 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 153-155 and ending at the stop codon found at nucleotide 
positions 510-512 (Figure 98; SEQ ID NO: 164). The predicted polypeptide precursor (Figure 99, SEQ ID 

30 NO: 165) is 1 19 amino acids long. PR0842 has a calculated molecular weight of approximately 13,819 Daltons 
and an estimated pi of approximately 11.16. Other features of PR0842 include a signal peptide at about amino 
acids 1-22, a potential protein kinase C phosphorylation site at about amino acids 39-41 and two potential N- 
myristoylation sites at about amino acids 27-32 and about amino acids 46-51. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 98 (SEQ ID NO: 164), evidenced some homology 
between the PR0842 amino acid sequence and the following Dayhoff sequences: CEZK13111, PR80843, 
RAT5HT2XJ, S81882J, A60912, MCU60315_137MC137L, U93422J, pj>91996, U93462J, and 
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ZN18_HUMAN. 

Clone DNA56855-1447 was deposited with the ATCC on June 23, 1998, and is assigned ATCC deposit 
no. 203004. 

EXAMPLE 42: Isolation of cDNA clones Encoding Human PRQ839 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte LIFESEQ® database, designated Incyte EST Cluster No. 24479. This EST 
cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included 
public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ* Incyte 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
10 the computer program BLAST or BLAST2 (Altshul et al. , Methods in Enzvmolnpv 7<*-dfn-AH\ (1996)). Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 
University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 
designated DNA55709. 

15 In light of an observed sequence homology between the DNA55709 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. 754525, the Merck EST clone 754525 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 100 and is herein designated as DNA56859-1445. 

The full length clone shown in Figure 100 contained a single open reading frame with an apparent 
20 translational initiation site at nucleotide positions 2-4 and ending at the stop codon found at nucleotide positions 
263-265 (Figure 100; SEQ ID NO: 166). The predicted polypeptide precursor (Figure 101, SEQ ID NO: 167). 
is 87 amino acids long. PR0839 has a calculated molecular weight of approximately 9,719 Daltons and an 
estimated pi of approximately 4.67. Other features of PR0839 include a signal peptide at about amino acids 1- 
23, potential protein kinase C phosphorylation sites at about amino acids 37-39 and about amino acids 85-87, 
25 a potential casein kinase II phosphorylation site at about amino acids 37-40, sequence identity with ribonucleotide 
reductase large subunit protein at about amino acids 50-60, and sequence identity with eukaryotic RNA-binding 
region RNP-1 proteins at about amino acids 70-79. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 101 (SEQ ID NO: 167), evidenced some 
30 homology between the PR0839 amino acid sequence and the following Dayhoff sequences: CD 14 MOUSE, 
XPR6_YARLI, HS714385_1, S49783, BB19_RABIT, GVPH-HALME, AB003135J, P_R85453, 
LUU27081J, and TP2BMOUSE. 

Clone DNA56859-1445 was deposited with the ATCC on June 23, 1998, and is assigned ATCC deposit 
no.209019. 

35 
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EXAMPLE 43 : Isolation of cDNA Clones Encoding Human PROl 180 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence (Incyte EST cluster sequence no. 14732). The Incyte EST cluster sequence no. 
14732 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public 
EST databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo 
5 Alto, CA) to identify existing homologies. The homology search was performed using the computer program 
BLAST or BLAST2 (Altshul et ah, Methods in Enzvmology 266 :460-480(1996^. Those comparisons resulting 
in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA55711. 

10 In light of an observed sequence homology between the DNA5571 1 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. T60981, the Merck EST clone T60981 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 102 and is herein designated DNA56860-1510. 

The full length clone shown in Figure 102 contained a single open reading frame with an apparent 

15 translational initiation site at nucleotide positions 78-80 and ending at the stop codon found at nucleotide positions 
909-911 (Figure 102; SEQ ID NO: 168). The predicted polypeptide precursor is 277 amino acids long, has a 
calculated molecular weight of approximately 31,416 daltons and an estimated pi of approximately 8.88. 
Analysis of the full-length PROl 180 sequence shown in Figure 103 (SEQ ID NO: 169) evidences the presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 23, a leucine zipper pattern 

20 sequence from about amino acid 10 to about amino acid 3 1 , and potential N-myristolation sited from about amino 
acid 64 to about amino acid 69, from about amino acid 78 to about amino acid 83, from about amino acid 80 
to about amino acid 85, from about amino acid 91 to about amino acid 96 and from about amino acid 201 to 
about amino acid 206. Clone DNA56860-1510 has been deposited with the ATCC on June 9, 1998 and is 
assigned ATCC deposit no. 209952. 

25 Analysis of the amino acid sequence of the full-length PROl 180 polypeptide suggests that it possesses 

sequence similarity to the methyltransferase family of proteins. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PRO 11 80 amino acid 
sequence and the following Dayhoff sequences, MTCI65J4, D69267, YH09_YEAST, BIOCSERMA, 
ATAC00448415T1D16.16, SHGCPIRJ8, SPBC3B9_4, AB009504J4, PJV17977 and A69952. 

30 

EXAMPLE 44 : Isolation of cDNA clones Encoding Human PROl 134 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 75 1 1 . This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
35 a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
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in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA55725. Two proprietary Genentech EST 
sequences were employed in the assembly and are shown in Figure 106 (SEQ ID NO: 172) and Figure 107 (SEQ 
ID NO: 173). 

5 In light of an observed sequence homology between the DNA55725 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. H94897, the Merck EST clone H94897 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 106 and is herein designated as DNA56865-1491. 

Clone DNA56865-1491 contains a single open reading frame with an apparent translation^ initiation 

10 site at nucleotide positions 153-155 and ending at the stop codon at nucleotide positions 1266-1268 (Figure 104). 
The predicted polypeptide precursor is 371 amino acids long (Figure 105). The full-length PR01134 protein 
shown in Figure 105 has an estimated molecular weight of about 4 1 ,935 daltons and a pi of about 9.58. Analysis 
of the full-length PR01134 sequence shown in Figure 105 (SEQ ID NO:171) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 23, potential N-glycosylation sites from 

15 about amino acid 103 to about amino acid 106, from about amino acid 249 to about amino acid 252 and from 
about amino acid 257 to about amino acid 260, and an amino acid block having homology to tyrosinase CuA- 
binding region proteins from about amino acid 280 to about amino acid 306. Clone DNA56865-1491 has been 
deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203022. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

20 alignment analysis of the full-length sequence shown in Figure 105 (SEQ ID NO: 171), evidenced significant 
homology between the PROH34 amino acid sequence and the following Dayhoff sequences: F20P518, 
AC002396J0, S47847, C64146, GSPA_BACSU, P_W10564, RFAIECOLI, Y258_HAEIN, RFAJ_SALTY 
and P_R32985. 

25 EXAMPLE 45 : Isolation of cDNA clones Encoding Human PRO830 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incytedatabase, designated 2025 1 . This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 

30 homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in E nzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA55733. 

35 In light of an observed sequence homology between the DNA55733 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. H78534, the Merck EST clone H78534 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
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The sequence of this cDNA insert is shown in Figure 108 and is herein designated as DNA56866-1342. 

Clone DNA56866-1342 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 154-156 and ending at the stop codon at nucleotide positions 415-417 (Figure 108). 
The predicted polypeptide precursor is 87 amino acids long (Figure 109) . The full-length PRO830 protein shown 
in Figure 109 has an estimated molecular weight of about 9,272 daltons and a pi of about 9. 19. Analysis of the 
5 full-length PRO830 sequence shown in Figure 109 (SEQ ID NO: 175) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 33, potential N-myristoylation sites from about 
amino acid 2 to about amino acid 7 and from about amino acid 8 to about amino acid 13 and a thioredoxin family 
of proteins homology block from about amino acid 23 to about amino acid 39. Clone UNQ470 (DNA56866- 
1342) has been deposited with ATCC on June 22, 1998 and is assigned ATCC deposit no. 203023. 
10 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

alignment analysis of the full-length sequence shown in Figure 109 (SEQ ID NO: 175), evidenced significant 
homology between the PRO830 amino acid sequence and the following Dayhoff sequences: HSU88154J, 
HSU88153J, SAPKSGENEJ, HPU31791J, GGCNOT2J, CPU91421J, CHKESTPC09J , PQ0769, 
U97553J79 and B60095. 

15 

EXAMPLE 46 : Isolation of cDNA clones Encoding Human PROl 1 15 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the LIFESEQ® database, designated Incyte EST cluster sequence no. 165008. This EST 
cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included 

20 public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2(Altshul etal., Methods in Enzvmology 266:460-480 (1996». Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 

25 University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 
designated DNA55726. 

In light of an observed sequence homology between the DNA55726 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. R75784, the Merck EST clone R75784 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

30 The sequence of this cDNA insert is shown in Figure 1 1 1 and is herein designated as DNA56868-1478. 

The full length clone shown in Figure 110 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 189-191 and ending at the stop codon found at nucleotide 
positions 1524-1526 (Figure 110; SEQ ID NO: 176). The predicted polypeptide precursor (Figure 111, SEQ 
ID NO: 177) is 445 amino acids long. PROl 115 has a calculated molecular weight of approximately 50,533 

35 Daltons and an estimated pi of approximately 8.26. Additional features include a signal peptide at about amino 
acids 1-20; potential N-glycosylation sites at about amino acids 204-207, 295-298, and 313-316; and putative 
transmembrane domains at about amino acids 35-54, 75-97, 126-146, 185-204, 333-350, and 353-371. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 1 1 1 (SEQ ID NO: 177), evidenced some amino 
acid sequence identity between the PR01115 amino acid sequence and the following Dayhoff sequences: 
AF053947_79, S73698, CEC47A1CM, CCOMTNDS5GJ , HS4LMP2ACJ, LMP2_EBV, PA24J40USE, 
HCU33331J7, P-W05508, and AF002273J. 
5 Clone DNA56868-1478 was deposited with the ATCC on June 23, 1998 and is assigned ATCC deposit 

no. 203024.. 

EXAMPLE 47 : Isolation of cDNA clones Encoding Human PRO 1277 

A consensus DNA sequence was assembled relative to other ESTs using repeated cycles of BLAST and 
10 the program "phrap" as described in Example 1 above. One or more of the ESTs from the assembly was 
derived from diseased coronary artery tissue. The consensus sequence obtained is designated herein as 
tt DNA49434 w . 

In light of an observed sequence homology between the DNA49434 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3042605, the lncyte EST clone 3042605 was purchased 
15 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 1 12 (SEQ ID NO: 178). 

Clone DNA56869-1545 contains a single open reading frame with an apparent translation^ initiation 
site at nucleotide positions 188-190, and an apparent stop codon at nucleotide positions 2222-2224 (Figure 1 12). 
The predicted polypeptide precursor is 678 amino acids long (Figure 113). The full-length PR01277 protein 
20 shown in Figure 113 has an estimated molecular weight of about 73,930 daltons and a pi of about 9.48. 
Additional features include a signal peptide at about amino acids 1-26; a transmembrane domain at about amino 
acids 181-200, and potential N-glycosylation sites at about amino acids 390-393 and 520-523. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 113 (SEQ ID NO: 179), revealed significant 
25 homology between the PR01277 amino acid sequence and Dayhoff sequence no AF012252 1 . Homology was 
also found between the PR01277 amino acid sequence and the following Dayhoff sequences: AF006740 1, 
CA36_HUMAN, HSU1J, HUMCOL7A1X1 , CA17JHUMAN, MMZ78163J, CAMA_CHICK, 
HSU69263J, YNX3 CAEEL, and MMRNAM31 . 

Clone DNA56869-1545 has been deposited with ATCC and is assigned ATCC deposit no. 203161. 

30 

EXAMPLE 48: Isolation of c DNA Clones Encoding Human PROH35 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA52767. Based on the DNA52767 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
35 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01135. 
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In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with PCR primer pairs prepared based upon the DNA52767 sequence. A positive 
library was then used to isolate clones encoding the PROl 135 gene using the probe oligonucleotide and one of 
the PCR primers. RNA for construction of the cDNA libraries was isolated from human coronary artery smooth 
muscle tissue (LIB309). The cDNA libraries used to isolate the cDNA clones were constructed by standard 
5 methods using commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was 
primed with oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, 
sized appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such 
as pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., 
Science . 253:1278-1280 (1991)) in the unique Xhol and NotI sites. 
10 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PROl 135 [herein designated as DNA56870-1492] (SEQ ID NO: 180) and the derived protein sequence for 
PROl 135. 

The entire nucleotide sequence of DNA56870-1492 is shown in Figure 1 14 (SEQ ID NO: 180). Clone 
DNA56870-1492 contains a single open reading frame with an apparent translational initiation site at nucleotide 

15 positions 62-64 and ending at the stop codon at nucleotide positions 1685-1687 (Figure 114). The predicted 
polypeptide precursor is 541 amino acids long (Figure 1 15). The full-length PROl 135 protein shown in Figure 
115 has an estimated molecular weight of about 60,335 daltons and a pi of about 5.26. Analysis of the full- 
length PROl 135 sequence shown in Figure 1 15 (SEQ ID NO: 181) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about aino acid 21, potential N-glycosylation sited from about amino 

20 acid 53 to about amino acid 56, from about amino acid 75 to about amino acid 78, from about amino acid 252 
to about amino acid 255 and from about amino acid 413 to about amino acid 4 16 and an amino acid block having 
homology to glycosyl hydrolase family 35 proteins from about amino acid 399 to about amino acid 414. Clone 
DNA56870-1492 has been deposited with ATCC on June 2, 1998 and is assigned ATCC deposit no. 209925. 
Analysis of the amino acid sequence of the full-length PROl 135 polypeptide suggests that it possesses 

25 significant sequence similarity to the alpha 1 ,2-mannosidase protein, thereby indicating that PROl 135 may be 
a novel marmosidase. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PROl 135 amino acid sequence and the following Dayhoff 
sequences, DMC86E4_5, D86967 1, SPAC23A1_4,YH(M_YEAST,B54408,SSMAN9MANJ, CEZC410_4, 
S61631 andMSU14190_l. 

30 

EXAMPLE 49 : Isolation of cDNA Clones Encoding Human PROl 1 14 

A cDN A sequence isolated in the amylase screen described in Example 2 above was found, by the WU- 
BLAST-2 sequence alignment computer program, to have certain sequence identity to other known interferon 
receptors. This cDNA sequence is herein designated DNA48466 and is shown in Figure 1 18 (SEQ ID NO: 184). 
35 Based on the sequence identity, probes were generated from the sequence of the DNA48466 molecule and used 
to screen a human breast carconoma library (LIB 1 35) prepared as described in paragraph 1 of Example 2 above . 
The cloning vector was pRK5B (pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes 
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et al., Science . 253:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp. 

The oligonucleotide probes employed were as follows: 
forward PCR primer 5 ' -AGGCTTCGCTGCGACTAG ACCTC-3 ' (SEQ ID NO: 185) 
reverse PCR primer 5 ' -CC AGGTCGGGTAAGGATGGTTG AG-3 ' (SEQ ID NO: 186) 
hybridization probe 

5 5^TTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC-3' (SEQ ID NO: 187) 
A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 250-252, and a stop signal at nucleotide positions 1183-1185 
(Figure 116, SEQ ID NO: 182). The predicted polypeptide precursor is 31 1 amino acids long, has a calculated 
molecular weight of approximately 35,076 daltons and an estimated pi of approximately 5.04. Analysis of the 

10 full-length PROl 1 14 interferon receptor sequence shown in Figure 1 17 (SEQ ID NO: 183) evidences the presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 29, a transmembrane domain 
from about amino acid 230 to about amino acid 255, potential N-glycosylation sites from about amino acid 40 
to about amino acid 43 and from about amino acid 134 to about amino acid 137, an amino acid sequence block 
having homology to tissue factor proteins from about amino acid 92 to about amino acid 1 19 and an amino acid 

15 sequence block having homology to integrin alpha chain proteins from about amino acid 232 to about amino acid 
262. Clone DNA57033-1403 has been deposited with ATCC on May 27, 1998 and is assigned ATCC deposit 
no. 209905. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 1 17 (SEQ ID NO: 183), evidenced significant 
20 homology between the PROl 1 14 interferon receptor amino acid sequence and the following Dayhoff sequences: 
G01418, INRl_MOUSE, P_R71035, INGSJiUMAN, A26595J, A26593J, 156215 and TFHUMAN. 

EXAMPLE 50 : Isolation of cDNA Clones Encoding Human PRQ828 

A consensus DNA sequence was identified using the method described in Example 1 above. This 
25 consensus sequence is herein designated DNA35717. Based on the DNA35717 consensus sequence, 

oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, 

and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0828. 
PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5 ' -GCAGGACTTCTACG ACTTC A AGOC-^ ' (SEQ ID NO: 190); and 
30 reverse PCR primer 5-AGTCTGGGCCAGGTACTTGAAnnr-T (SEQ ID NO: 191). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA35717 

sequence which had the following nucleotide sequence: 

hybridization probe 

5 ' -C AAC ATCCGGGGC AAACTGGTGTCGCTGG AGA AGT ACCGCGG ATCGGTGT-3 ' (SEQ ID NO: 192) 
35 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0828 gene using the probe oligonucleotide and one of the PCR primers. RNA 
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for construction of the cDNA libraries was isolated from human fetal lung tissue (LIB25). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0828 [herein designated as DNA57037-1444] (SEQ ID NO: 188) and the derived protein sequence for 
PR0828. 

The entire nucleotide sequence of DNA57037-1444 is shown in Figure 1 19 (SEQ ID NO: 188). Clone 
5 DNA57037-1444 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 34-36 and ending at the stop codon at nucleotide positions 595-597 (Figure 119). The predicted 
polypeptide precursor is 187 amino acids long (Figure 120). The full-length PR0828 protein shown in Figure 
120 has an estimated molecular weight of about 20,996 daltons and a pi of about 8.62. Analysis of the full- 
length PR0828 sequence shown in Figure 120 (SEQ ID NO: 189) evidences the presence of the following: a 

10 signal peptide at about amino acids 1- 21; sequences identity to glutathione peroxidases signature 2 at about 
amino acids 82-89; sequence identity to glutathione peroxidases selenocysteine proteins at about amino acids 35- 
60, 63-100, 107-134, and 138-159. Clone DNA57037-1444 has been deposited with ATCC on May 27, 1998, 
and is assigned ATCC deposit no. 209903. 

Analysis of the amino acid sequence of the full-length PR0828 polypeptide suggests that it possesses 

15 significant sequence similarity to glutathione peroxidases, thereby indicating that PR0828 may be a novel 
peroxidase enzyme. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced sequence identity between the PR0828 amino acid sequence and the following Dayhoff sequences: 
AF053311J, CELT09A12_2, AC004151J, BTUE_ECOLI, CER05H10 3, PJ>80918, PWU88907J, and 
PW22308. 

20 

EXAMPLE 51 : Isolation of cDNA clones Encoding Human PRO10Q9 

A cDNA clone (DNA57129-1413) encoding a native human PRO1009 polypeptide was identified by 

the use of a yeast screen, in a human SK-Lu-1 adenocarcinoma cell line cDNA library that preferentially 

represents the 5' ends of the primary cDNA clones. First SEQ ID NO: 195 (Figure 123) was identified, which 
25 was extended by alignments to other EST sequences to form a consensus sequence. Oligonucleotide probes 

based upon the consensus sequence were synthesized and used to screen the cDNA library which gave rise to 

the full-length DNA57129-1413 clone. 

The full length DN A57 1 29- 1 4 1 3 clone shown in Figure 1 2 1 contained a single open reading frame with 

an apparent translation^ initiation site at nucleotide positions 41-43 and ending at the stop codon found at 
30 nucleotide positions 1886-1888 (Figure 121; SEQ ID NO: 193). The predicted polypeptide precursor (Figure 

122, SEQ ID NO: 194) is 615 amino acids long. Figure 122 also shows the approximate locations of the signal 

sequence, transmembrane domains, myristoylation sites, a giycosylation site and an AMP-binding domain. 

PRO1009 has a calculated molecular weight of approximately 68,125 daltons and an estimated pi of 

approximately 7.82. Clone DNA57129-1413 has been deposited with ATCC and is assigned ATCC deposit no. 
35 209977. It is understood that the deposited clone has the actual and correct sequence and that the representations 

herein may have minor, normal sequencing errors. 



419 



WO 99/63088 



PCT/US99/12252 



Based on a WU-BLAST-2 sequence alignment analysis (using the ALIGN computer program) of the 
full-length sequence, PRO1009 shows amino acid sequence identity to at least the following proteins which were 
designated in a Dayhoff database as follows: F69893, CEF28F8J2, BSY13917_7, BSY13917J7, D69187, 
D69649, XCRPFBJ, E64928, YDID_ECOLI, BNACSF8J and RPU75363_2. 

5 EXAMPLE 52 : Isolation of cDNA Clones Encoding Human PRO 1007 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated as DNA40671. 

In light of an observed sequence homology between the DNA40671 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T70513, the Merck EST clone T70513 was purchased 
10 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 124. 

The entire nucleotide sequence of DNA57690-1374 is shown in Figure 124 (SEQ ID NO:196). Clone 
DNA57690-1374 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 16-18 and ending at the stop codon at nucleotide positions 1054-1056 (Figure 124). The predicted 
15 polypeptide precursor is 346 amino acids long (Figure 125). The full-length PRO1007 protein shown in Figure 
125 has an estimated molecular weight of about 35,971 daltons and a pi of about 8.17. Clone DNA57690-1374 
has been deposited with the ATCC on June 9, 1998. Regarding the sequence, it is understood that the deposited 
clone contains the actual sequence, and the sequences provided herein are based on known sequencing 
techniques. The representative figures herein show the representative numbering. 
20 Analysis of the amino acid sequence of the full-length PRO 1007 polypeptide suggests that portions of 

it possess sequence identity to MAGPIAP, thereby indicating that PRO 1007 may be a novel member of the 
family to which MAGPIAP belongs. . 

Still analyzing the amino acid sequence of SEQ ID NO: 197, the putative signal peptide is at about amino 
acids 1-30 of SEQ ID NO: 197. The transmembrane domain is at amino acids 325-346 of SEQ ID NO: 197. N- 
25 glycosylate sites are at about amino acids 118-121, 129-132, 163-166, 176-179, 183-186 and 227-130 of SEQ 
ID NO: 197. Ly-6/u-Par domain protein homology is at about amino acids 17-36 and 209-222 of SEQ ID 
NO: 197. The corresponding nucleotides of the amino acids presented herein can be routinely determined given 
the sequences provided herein. 

30 EXAMPLE 53 : Isolation of cDNA clones Encoding Human PRO 1056 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the lncyte database, designated herein as 6425. This EST cluster sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
GenBank) and a proprietary EST DNA database (Lifeseq®, lncyte Pharmaceuticals, Palo Alto, CA) to identity 

35 existing homologies. The homology search was performed using the computer program BLAST or BLAST2 
(Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
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consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA55736. 

In light of an observed sequence homology between the DNA55736 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. R88049, the Merck EST clone R88049 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
5 The sequence of this cDNA insert is shown in Figure 126 and is herein designated as DNA57693-1424. 

Clone DNA57693-1424 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 56-58 and ending at the stop codon at nucleotide positions 416-418 (Figure 126). The 
predicted polypeptide precursor is 120 amino acids long (Figure 127). The full-length PRO1056 protein shown 
in Figure 127 has an estimated molecular weight of about 13,345 daltons and a pi of about 5.18. Analysis of 

10 the full-length PRO1056 sequence shown in Figure 127 (SEQ ID NO: 199) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 18, a transmembrane domain from about 
amino acid 39 to about amino acid 58, a potential N-glycosylation site from about amino acid 86 to about amino 
acid 89, protein kinase C phosphorylation sites from about amino acid 36 to about amino acid 38 and from about 
amino acid 58 to about amino acid 60, a tyrosine kinase phosphorylation site from about amino acid 25 to about 

15 amino acid 32 and an amino acid sequence block having homology to channel forming colicin proteins from 
about amino acid 24 to about amino acid 56. Clone DNA57693-1424 has been deposited with ATCC on June 
23, 1998 and is assigned ATCC deposit no. 203008. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 127 (SEQ ID NO: 199), evidenced significant 

20 homology between the PRO1056 amino acid sequence and the following Dayhoff sequences: PLM HUMAN, 
A40533, ATNGHUM AN , A55571, ATNGSHEEP, S31524, GEN 13025, RIC_MOUSE, A48678 and 
A10871_l. 

EXAMPLE 54: Isolation of cDNA clones Encoding Human PRQ826 

25 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 47283. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

30 et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56000. 

In light of an observed sequence homology between the DNA56000 consensus sequence and an EST 

35 sequence encompassed within the Merck EST clone no. W69233, the Merck EST clone W69233 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 128 and is herein designated as DNA57694-1341. 
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Clone DNA57694-1341 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 13-15 and ending at the stop codon at nucleotide positions 310-312 (Figure 128). The 
predicted polypeptide precursor is 99 amino acids long (Figure 129). The full-length PR0826 protein shown 
in Figure 129 has an estimated molecular weight of about 1 1,050 daltons and a pi of about 7.47. Analysis of 
the full-length PR0826 sequence shown in Figure 129 (SEQ ID NO:201) evidences the presence of the 
5 following: a signal peptide from about amino acid 1 to about amino acid 22, potential N-myristoylation sites from 
about amino acid 22 to about amino acid 27 and from about amino acid 90 to about amino acid 95 and an amino 
acid sequence block having homology to peroxidase from about amino acid 16 to about amino acid 48. Clone 
DNA57694-1341 has been deposited with ATCC on June 22, 1998 and is assigned ATCC deposit no. 203017. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
10 alignment analysis of the full-length sequence shown in Figure 129 (SEQ ID NO:201), evidenced significant 
homology between the PR0826 amino acid sequence and the following Dayhoff sequences: CCU12315JI, 
SCU96108_6, CELF39F10_4 and HELT HELHO. 

EXAMPLE 55 : Isolation of cDNA clones Encoding Human PRQ819 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 49605. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

20 et al., Methods in Enzvmolopv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56015. 

In light of an observed sequence homology between the DNA56015 consensus sequence and an EST 

25 sequence encompassed within the Merck EST clone no. H65785, the Merck EST clone H65785 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 130 and is herein designated as DNA57695-1340. 

Clone DNA57695-1340 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 46-48 and ending at the stop codon at nucleotide positions 202-204 (Figure 130). The 

30 predicted polypeptide precursor is 52 amino acids long (Figure 131). The full-length PR0819 protein shown 
in Figure 131 has an estimated molecular weight of about 5,216 daltons and a pi of about 4.67. Analysis of the 
full-length PR0819 sequence shown in Figure 131 (SEQ ID NO:203) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 24, a potential N-myristoylation site from about 
amino acid 2 to about amino acid 7 and a region having homology to immunoglobulin light chain from about 

35 amino acid 5 to about amino acid 33. Clone DNA57695-1340 has been deposited with ATCC on June 23, 1998 
and is assigned ATCC deposit no. 203006. 



422 



WO 99/63088 



PCT/US99/12252 



An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 131 (SEQ ID NO:203), evidenced significant 
homology between the PR0819 amino acid sequence and the following Dayhoff sequences: HSU03899 1, 
HUMIGLITEBJ, VG28_HSVSA, AF031522J, PAD1YEAST and AF045484J. 

5 EXAMPLE 56 : Isolation of cDNA Clones Encoding Human PRO 1006 

An initial candidate sequence from Incyte cluster sequence no. 45748 was identified using the signal 
algorithm process described in Example 3 above. This sequence was then aligned with a variety of public and 
Incyte EST sequences and a consensus sequence designated herein as DNA56036 was derived therefrom. 

In light of an observed sequence homology between the DNA56036 consensus sequence and an EST 
10 sequence encompassed within the Merck EST clone no. 489737, the Merck EST clone 489737 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 132. 

The entire nucleotide sequence of DNA57699-1412 is shown in Figure 132 (SEQ ID NO:204). Clone 
DNA57699-1412 contains a single open reading frame with an apparent translational initiation site at nucleotide 
15 positions 28-30 and ending at the stop codon at nucleotide positions 1204-1206 (Figure 132). The predicted 
polypeptide precursor is 392 amino acids long (Figure 133). The full-length PRO1006 protein shown in Figure 
133 has an estimated molecular weight of about 46, 189 daltons and a pi of about 9.04. Clone DNA57699-1412 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 
20 Analyzing the amino acid sequence of SEQ ID NO: 205, the putative signal peptide is at about amino 

acids 1-23 of SEQ ID NO:205. The N-glycosylation sites are at about amino acids 40-43, 53-56, 204-207 and 
373-376 of SEQ ID NO:205. An N-myristoylation site is at about amino acids 273-278 of SEQ ID NO:205. 

The corresponding nucleotides of these amino acid regions and others can be routinely determined given the 
sequences provided herein. 

25 

EXAMPLE 57 : Isolation of cDNA Clones Encoding Human PRQ1112 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a specific 
EST cluster sequence. This EST cluster sequence was then compared to a variety of expressed sequence tag 
(EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database 

30 (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was 
performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmology 266:460- 
480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 
encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" 
(Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is 

35 herein designated DNA56018. 

In light of an observed sequence homology between the DNA56018 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA223546, the Merck EST clone AA223546 was 
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purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 134 and is herein designated as DNA57702-1476. 

The entire nucleotide sequence of DNA57702-1476 is shown in Figure 134 (SEQ ID NO:206). Clone 
DNA57702-1476 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 20-22 and ending at the stop codon at nucleotide positions 806-808 of SEQ ID NO:206 (Figure 134). 
5 The predicted polypeptide precursor is 262 amino acids long (Figure 135). The full-length PR01112 protein 
shown in Figure 135 has an estimated molecular weight of about 29,379 daltons and a pi of about 8.93. Figure 
135 also shows the approximate locations of the signal peptide and transmembrane domains. Clone DNA57702- 
1476 has been deposited with the ATCC on June 9, 1998. It is understood that the deposited clone has the actual 
nucleic acid sequence and that the sequences provided herein are based on known sequencing techniques. 

10 Analysis of the amino acid sequence of the full-length PROl 1 12 polypeptide suggests that it possesses 

some sequence similarity to other proteins. More specifically, an analysis of the Dayhoff database (version 
35.45 SwissProt 35) evidenced some sequence identity between the PROl 1 12 amino acid sequence and at least 
the following Dayhoff sequences, MTY20B1113 (a mycobacterium tuberculosis peptide), F64471, 
AE000690_6, XLU16364J, E43259 (H+ -transporting ATP synthase) and PIGSLADRXEl (MHC class II 

15 histocompatibility antigen). 

EXAMPLE 58: Isolation of cDNA clones Encoding Human PRO 1074 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence (Incyte cluster sequence No. 42586). This cluster sequence was then compared to 

20 a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

25 DNA sequence with the program "phrap" (Phil Green, Univ. of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56251. 

In light of an observed sequence homology between the DNA56251 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA081912, the Merck EST clone AA081912 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 

30 protein. The sequence of this cDNA insert is shown in Figure 136 and is the full-length DNA sequence for 
PRO1074. Clone DNA57704-1452 was deposited with the ATCC on June 9, 1998, and is assigned ATCC 
deposit no. 209953. 

The entire nucleotide sequence of DNA57704-1452 is shown in Figure 136 (SEQ ID NO:208). Clone 
DNA57704-1452 contains a single open reading frame with an apparent translational initiation site at nucleotide 
35 positions 322-324 and ending at the stop codon at nucleotide positions 1315-1317 (Figure 136). The predicted 
polypeptide precursor is 331 amino acids long (Figure 137). The full-length PRO1074 protein shown in Figure 
137 has an estimated molecular weight of about 39,512 Daltons and a pi of about 8.03. Analysis of the full- 
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length PRO1074 sequence shown in Figure 137 (SEQ ID NO:209) evidences the presence of the following 
features: a transmembrane domain at about amino acids 20 to 39; potential N-glycosylation sites at about amino 
acids 72 to 75, 154 to 157, 198 to 201, 212 to 215, and 326 to 329; a glycosaminoglycan attachment site at about 
amino acids 239 to 242, and a Ly-6/u-PAR domain at about amino acids 23 to 36. 

Analysis of the amino acid sequence of the full-length PRO 1074 polypeptide suggests that it possesses 
5 significant sequence similarity to beta 1 ,3-galactosyltransf erase, thereby indicating that PRO 1074 may be a novel 
member of the galactosyltransferase family of proteins. Analysis of the amino acid sequence of the full-length 
PRO1074 polypeptide using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between 
the PRO 1074 amino acid sequence and the following Dayhoff sequences: AF029792J, P R57433, 
DMU41449J, AC000348J4, P_R47479, CET09F5_2, CEF14B6_4, CET15D6_5, CEC54C8_4, and 
10 CEE03H4J0. 

Clone DNA57704-1452 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit 
no. 209953. 

EXAMPLE 59 : Isolation of cDNA clones Encoding Human PRO 1005 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the LIFESEQ® database, Incyte cluster sequence no. 49243. This EST cluster sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 

20 BLAST2 (Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56380. 

In light of an observed sequence homology between the DNA56380 consensus sequence and an EST 

25 sequence encompassed within the Merck EST clone no. AA256657, the Merck EST clone AA256657 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 138 and is herein designated as DNA57708-141 1 . 

The full length clone shown in Figure 138 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 30-32 and ending at the stop codon found at nucleotide positions 

30 585-587 (Figure 138; SEQ ID NO:210). The predicted polypeptide precursor (Figure 139, SEQ ID NO:21 1) 
is 185 amino acids long. PRO1005 has a calculated molecular weight of approximately 20,331 daltons and an 
estimated pi of approximately 5.85. Clone DNA57708-141 1 was deposited with the ATCC June 23, 1998, and 
is assigned ATCC deposit no. 203021 . 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 139 (SEQ ID NO:211), evidenced some 
homology between the PRO 1005 amino acid sequence and the following Dayhoff sequences: DDU07187 1, 
DDU87912J, CELD1007J4, A42239, DDU42597J, CYAGJDICDI, S50452, MRKCKLEPN, P-R41998, 
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and XYNA_RUMFL. 

EXAMPLE 60 : Isolation of cDNA clones Encoding Human PRO1073 

An initial DNA sequence referred to herein as DNA55938 and shown in Figure 142 (SEQ ID NO:214) 
was identified using a yeast screen, in a human SK-Lu-1 adenocarcinoma cell line cDNA library that 
5 preferentially represents the 5 ' ends of the primary cDN A clones. DNA55938 was then compared to ESTs from 
public databases (e.g., GenBank), and a proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo 
Alto, CA), using the computer program BLAST or BLAST2 [Altschul et al. , Methods in Enzvrnology. 266:460- 
480 (1996)]. The ESTs were clustered and assembled into a consensus DNA sequence using the computer 
program M phrap n (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

10 obtained is designated herein as DNA5641 1 . 

In light of an observed sequence homology between the DNA56411 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H86027, the Merck EST clone H86027 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 140. 

15 The full length DNA57710-1451 clone shown in Figure 140 contained a single open reading frame with 

an apparent translational initiation site at nucleotide positions 345-347 and ending at the stop codon found at 
nucleotide positions 1242-1244 (Figure 140; SEQ ID NO:212). The predicted polypeptide precursor (Figure 
141, SEQ ID NO:213) is 299 amino acids long. PRO1073 has a calculated molecular weight of approximately 
34,689 daltons and an estimated pi of approximately 11.49. The PRO 1073 polypeptide has the following 

20 additional features: a signal peptide at about amino acids 1-31, sequence identity to bZIP transcription factor 
basic domain signature at about amino acids, a potential N-glycosylation site at about amino acids 2-5, and 
sequence identity with protamine PI proteins at about amino acids 158-183. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 141 (SEQ ID NO:2 13), revealed some sequence 

25 identity between the PRO 1073 amino acid sequence and the following Dayhoff sequences: MMU37351 1, 
ATAC00250510T9J22.10, S59043, ENXNUPRJ, B47328, SR55_DROME, S26650, SONHUMAN, 
VIT2 CHICK, and XLC4SRPRTJ. 

Clone DNA57710-1451 was deposited with the ATCC on July 1, 1998 and is assigned ATCC deposit 
no. 203048. 

30 

EXAMPLE 61 isolation of cDNA clones Encoding Human PROl 152 

A cDNA clone (DNA577 11-1501) encoding a native human PROl 152 polypeptide was identified by 
employing a yeast screen, in a human infant brain cDNA library that preferentially represents the 5' ends of the 
primary cDNA clones. Specifically, a yeast screen was employed to identify a cDNA designated herein as 
35 DNA55807 (SEQ ID NO:217; see Figure 145). 

In light of an observed sequence homology between the DNA55807 sequence and an EST sequence 
encompassed within the Merck EST clone no. R56756, the Merck EST clone R56756 was purchased and the 
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cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 143. 

The full-length DNA5771 1-1501 clone shown in Figure 143 contains a single open reading frame with 
an apparent translationai initiation site at nucleotide positions 58-60 and ending at the stop codon at nucleotide 
positions 1495-1497 (Figure 143). The predicted polypeptide precursor is 479 amino acids long (Figure 144). 
5 The full-length PROl 152 protein shown in Figure 144 has an estimated molecular weight of about 53,602 daltons 
and a pi of about 8.82. Analysis of the full-length PROl 152 sequence shown in Figure 144 (SEQ ID NO:216) 
evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 28, 
transmembrane domains from about amino acid 133 to about amino acid 155, from about amino acid 168 to 
about amino acid 187, from about amino acid 229 to about amino acid 247, from about amino acid 264 to about 

10 amino acid 285, from about amino acid 309 to about amino acid 330, from about amino acid 371 to about amino 
acid 390 and from about amino acid 441 to about amino acid 464, potential N-glycosylation sites from about 
amino acid 34 to about amino acid 37 and from about amino acid 387 to about amino acid 390 and an amino acid 
sequence block having homology to a respiratory-chain NADH dehydrogenase subunit from about amino acid 
243 to about amino acid 287. Clone DNA5771 1-1501 has been deposited with ATCC on July 1, 1998 and is 

15 assigned ATCC deposit no. 203047. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 144 (SEQ ID NO:216), evidenced significant 
homology between the PROl 152 amino acid sequence and the following Dayhoff sequences: AF052239 1, 
SYNN9CGAJ, SFCYTB2J, GEN12507, PJU1769, MTV025J09, C61168, S43171, P P61689 and 

20 P_P61696. 

EXAMPLE 62: Isolation of cDNA clones Encoding Human PROl 136 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 109142. This EST cluster sequence was then compared 

25 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

30 DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56039. 

In light of an observed sequence homology between the DNA56039 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. HSC1NF011, the Merck EST clone HSC1NF011 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 

35 protein. The sequence of this cDNA insert is shown in Figure 146 and is herein designated as DNA57827-1493. 

Clone DNA57827-1493) contains a single open reading frame with an apparent translationai initiation 
site at nucleotide positions 216-218 and ending at the stop codon at nucleotide positions 21 12-2114 (Figure 146). 
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The predicted polypeptide precursor is 632 amino acids long (Figure 147). The full-length PROl 136 protein 
shown in Figure 147 has an estimated molecular weight of about 69,643 daltons and a pi of about 8.5. Analysis 
of the full-length PROl 136 sequence shown in Figure 147 (SEQ ID NO:219) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 15 and potential N-glycosylation sites 
from about amino acid 108 to about amino acid 11, from about amino acid 157 to about amino acid 160, from 
5 about amino acid 289 to about amino acid 292 and from about amino acid 384 to about amino acid 387. Clone 
DNA57827-1493 has been deposited with ATCC on July 1, 1998 and is assigned ATCC deposit no. 203045. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 147 (SEQ ID NO:219), evidenced significant 
homology between the PROl 136 amino acid sequence and the following Dayhoff sequences: AF034746 1, 
10 AF034745J, MMAF000168J9, HSMUPP1J, AF060539J, SP97RAT, 138757, MMU93309J, 
CEK01A6_4 and HSA224747 1. 

EXAMPLE 63 : Isolation of cDNA clones Encoding Human PRQ813 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

15 Incyte EST cluster sequence (Incyte EST cluster sequence no. 45501 . The Incyte EST cluster sequence no. 
45501 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public 
EST databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ ™, Incyte Pharmaceuticals, Palo 
Alto, CA) to identify existing homologies. The homology search was performed using the computer program 
BLAST or BLAST2 (Altshul et al. , Methods in Enzvmology 266:460-480 ( 1 996)). Those comparisons resulting 

20 in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program u phrap M (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56400. 

In light of an observed sequence homology between the DNA56400 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T90592, the Merck EST clone T90592 was purchased 

25 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 148 and is herein designated DNA57834-1339. 

The full length clone shown in Figure 148 contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 109-111 and ending at the stop codon found at nucleotide 
positions 637-639 (Figure 149; SEQ ID NO:221). The predicted polypeptide precursor is 176 amino acids long, 

30 has a calculated molecular weight of approximately 19,616 daltons and an estimated pi of approximately 7.11. 
Analysis of the full-length PR0813 sequence shown in Figure 149 (SEQ ID NO:221) evidences the presence of 
the following: a signal peptide from about amino acid 1 to about amino acid 26 and potential N-myristoylation 
sites from about amino acid 48 to about amino acid 53, from about amino acid 153 to about amino acid 158, 
from about amino acid 156 to about amino acid 161 and from about amino acid 167 to about amino acid 172. 

35 Clone DNA57834-1339 has been deposited with the ATCC on June 9, 1998 and is assigned ATCC deposit no. 
209954. 
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Analysis of the amino acid sequence of the full-length PR0813 polypeptide suggests that it possesses 
sequence similarity to the pulmonary surfactant-associated protein C. More specifically, an analysis of the 
Dayhoff database (version 35.45 SwissProt 35) evidenced some degree of homology between the PR0813 amino 
acid sequence and the following Dayhoff sequences, PSPC_MUSVI, P_P92071, G02964, PR65489, P_P82977, 
P_R84555, S55542, MUSIGHAJJ and PHI 158. 

5 

EXAMPLE 64 : Isolation of cDNA Clones Encoding Human PRQ809 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence. The Incyte EST cluster sequence was then compared to a variety of expressed 
sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary EST DNA 

10 database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology 
search was performed using the computer program BLAST or BLAST2 ( Altshul et al . , Methods in Enzvmology 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

15 obtained therefrom is herein designated DNA56418. 

In light of an observed sequence homology between the DNA56418 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H74302, the Merck EST clone H74302 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 150 and is herein designated DNA57836-1338. 

20 The entire nucleotide sequence of DNA57836-1338 is shown in Figure 150 (SEQ ID NO:222). Clone 

DNA57836-1338 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 63-65 and ending at the stop codon at nucleotide positions 858-860 of SEQ ID NO:222 (Figure 150). 
The predicted polypeptide precursor is 265 amino acids long (Figure 151). The full-length PRO809 protein 
shown in Figure 151 has an estimated molecular weight of about 29,061 daltons and a pi of about 9. 18. Figure 

25 151 further shows the approximate positions of the signal peptide and N-glysosylation sites. The corresponding 
nucleotides can be determined by referencing Figure 150. Clone DNA57836-1338 has been deposited with 
ATCC on June 23, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that 
the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO809 polypeptide suggests that it possesses 

30 some sequence similarity to the heparin sulfate proteoglycan and to endothelial cell adhesion molecule- 1 . More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced sequence identity 
between the PRO809 amino acid sequence and the following Dayhoff sequences, PGBMMOUSE, D82082_l 
and PW14158. 

35 EXAMPLE 65: Isolation of cDNA Clones Encoding Human PRQ791 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence. The Incyte EST cluster sequence was then compared to a variety of expressed 
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sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary EST DN A 
database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology 
search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in Enzvmology 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
5 program tt phrap w (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56429. 

In light of an observed sequence homology between the DNA56429 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 36367, the Merck EST clone 36367 was purchased and 
the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 

10 sequence of this cDNA insert is shown in Figure 152 and is herein designated DNA57838-1337. 

The entire nucleotide sequence of DNA57838-1337 is shown in Figure 152 (SEQ ID NO:224). Clone 
DNA57838-1337 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 9-11 and ending at the stop codon at nucleotide positions 747-749 of SEQ ID NO:224 (Figure 152). 
The predicted polypeptide precursor is 246 amino acids long (Figure 153). The full-length PR0791 protein 

15 shown in Figure 153 has an estimated molecular weight of about 27 ,368 daltons and a pi of about 7.45. Figure 
153 also shows the approximate locations of the signal peptide, the transmembrane domain, N-glycosylation 
sites and a region conserved in extracellular proteins. The corresponding nucleotides of one embodiment 
provided herein can be identified by referencing Figure 152. Clone DNA57838-1337 has been deposited with 
ATCC on June 23, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that 

20 the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0791 polypeptide suggests that it has sequence 
similarity with MHC-I antigens, thereby indicating that PR0791 may be related to MHC-1 antigens. More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced some sequenc identity 
between the PR0791 amino acid sequence and the following Dayhoff sequences, AF034346_1 , MMQ1K51 and 

25 HFEHUMAN. 

EXAMPLE 66 : Isolation of cDNA clones Encoding Human PRO 1004 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence, Incyte cluster sequence No. 73681 . This EST cluster sequence was then compared 

30 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
to identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap w (Phil Green, Univ. of Washington, Seattle, 

35 Washington). The consensus sequence obtained therefrom is herein designated as DNA56516. 

In light of an observed sequence homology between the DNA56516 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H43837, the Merck EST clone H43837 was purchased 
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and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 154. 

The full length clone shown in Figure 154 contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 119-121 and ending at the stop codon at nucleotide positions 
464-466 (Figure 154; SEQ ID NO:226). The predicted polypeptide precursor is 1 15 amino acids long (Figure 
5 155; SEQ ID NO:227). The full-length PRO1004 protein shown in Figure 155 has an estimated molecular 
weight of about 13,649 daltons and a pi of about 9.58. Analysis of the full-length PRO1004 sequence shown 
in Figure 155 (SEQ ID NO:227) evidences the presence of the following features: a signal peptide at about amino 
acids 1-24, a microbodies C-terminal targeting signal at about amino acids 1 13-1 15, a potential N-glycosylation 
site at about amino acids 71-74, and a domain having sequence identity with dihydrofolate reductase proteins at 
10 about amino acids 22-48. 

Analysis of the amino acid sequence of the full-length PRO1004 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced homology between the PRO 1004 amino acid sequence and the following 
Dayhoff sequences: CELR02D3J7, LECIJvlOUSE, AF0066913, SSZ97390J, SSZ97395J, and 
SSZ97400J. 

1 5 Clone DN A57844- 14 10 was deposited with the ATCC on June 23 , 1 998, and is assigned ATCC deposit 

no. 203010. 

EXAMPLE 67 : Isolation of cDNA clones Encoding Human PROl 1 1 1 

An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) 

20 was searched and an EST was identified which had homology to insulin-like growth factor binding protein. 

RNA for construction of cDNA libraries was isolated from human fetal brain. The cDNA libraries used 
to isolate the cDNA clones encoding human PROl 1 1 1 were constructed by standard methods using commercially 
available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT 
containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately 

25 by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al.. Science . 
253 : 1278-1280 (1991)) in the unique Xhol and NotI. 

The human fetal brain cDNA libraries (prepared as described above), were screened by hybridization 
with a synthetic oligonucleotide probe based upon the Incyte EST sequence described above: 

30 S'-CCACCACCTGGAGGTCCTGCAGTTGGGCAGGAACTCCATCCGGCAGATTG^' (SEQ ID NO:251). 

An identified cDNA clone was sequenced in entirety. The entire nucleotide sequence of PROl 1 1 1 is 
shown in Figure 156 (SEQ ID NO:228). Clone DNA58721-1475 contains a single open reading frame with an 
apparent translational initiation site at nucleotide positions 57-59 and a stop codon at nucleotide positions 2016- 
2018 (Figure 156; SEQ ID NO:228). The predicted polypeptide precursor is 653 amino acids long (Figure 157). 

35 The transmembrane domains are at positions 21-40 (type II) and 528-548. Clone DNA58721-1475 has been 
deposited with ATCC and is assigned ATCC deposit no. 203 1 10. The full-length PROl 1 1 1 protein shown in 
Figure 157 has an estimated molecular weight of about 72,717 daltons and a pi of about 6.99. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 157 (SEQ ID NO:229), revealed some sequence 
identity between the PROl 1 1 1 amino acid sequence and the following Dayhoff sequences: A58532, D86983 1 , 
RNPLGPV 1, PGS2_HUMAN, AF038127J, ALS_MOUSE, GPV_HUMAN, PGS2_B0VIN, ALS_PAPPA 
and 147020. 

5 

EXAMPLE 68: Isolation of cDNA clones Encoding Human PRO 1344 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA33790. Based on the DNA33790 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
10 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01344. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5' -AGGTTCGTGATGGAGACAACCGCG-3 ' (SEQ ID N0:232) 
reverse PCR primer 5 '-TGTC AAGGACGC ACTGCCGTC ATG-3 ' (SEQ ID NO:233) 
15 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA33790 
sequence which had the following nucleotide sequence 
hybridization probe 

5'-TGGCCAGATCATCAAGCGTGTCTGTGGCAACGAGCGGCCAGCTCCTATCC-3' (SEQ ID NO:234) 
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

20 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 1344 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01344 (designated herein as DNA58723-1588 [Figure 158, SEQ ID NO:230]); and the derived protein 

25 sequence for PR01344. 

The entire nucleotide sequence of DNA58723-1588 is shown in Figure 158 (SEQ ID NO:230). Clone 
DNA58723-1588 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 26-28 and ending at the stop codon at nucleotide positions 2186-2188 (Figure 158). The predicted 
polypeptide precursor is 720 amino acids long (Figure 159). The full-length PRO 1344 protein shown in Figure 

30 159 has an estimated molecular weight of about 80,199 daltons and a pi of about 7.77. Analysis of the full- 
length PR01344 sequence shown in Figure 159 (SEQ ID N0:231) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 23, an EGF-like domain cysteine protein signature 
sequence from about amino acid 260 to about amino acid 271, potential N-glycosylation sites from about amino 
acid 96 to about amino acid 99, from about amino acid 279 to about amino acid 282, from about amino acid 3 16 

35 to about amino acid 319, from about amino acid 451 to about amino acid 454 and from about amino acid 614 
to about amino acid 617, an amino acid sequence block having homology to serine proteases, trypsin family from 
about amino acid 489 to about amino acid 505 and a CUB domain protein profile sequence from about amino 
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. acid 150 to about amino acid 166. Clone DNA58723-1588 has been deposited with ATCC on August 18, 1998 
and is assigned ATCC deposit no. 203133. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 159 (SEQ ID NO:231), evidenced significant 
homology between the PRO 1344 amino acid sequence and the following Dayhoff sequences: S77063_l, 
5 CRARMOUSE, P_R74775, PJ>90070, P_R09217, P_P70475, HSBMP16J and U50330J. 

EXAMPLE 69: Isolation of cDNA clones Encoding Human PROl 109 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA52642. The consensus DNA sequence 
1 0 was obtained by extending using repeated cycles of BLAST and phrap a previously obtained consensus sequence 
as far as possible using the sources of EST sequences discussed above. Based on the DNA52642 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 
of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PROl 109. 

PCR primers (forward and reverse) were synthesized: 
15 forward PCR primer 5 ' -CCTT ACCTC AG AGGCC AG AGC A AGC-3 ' (SEQ ID NO:237) 
reverse PCR primer 5 ' -GAGCTTCATCCGTTCTGCGTTC ACC-3 ' (SEQ ID NO:238) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA52642 
sequence which had the following nucleotide sequence 
hybridization probe 

20 5 ' -CAGGAATGTAAAGCTTTAC AGAGGGTCGCC ATCCTCGTTCCCCACC-3 ' (SEQ ID NO:239) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 109 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human SK-Lu-1 adenocarcinoma cell tissue (LIB247). 

25 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PROl 109 (designated herein as DNA58737-1473 [Figure 160, SEQ ID NO:235]) and the derived protein 
sequence for PRO 1109. 

The entire nucleotide sequence of DNA58737-1473 is shown in Figure 160 (SEQ ID NO:235). Clone 
DNA58737-1473 contains a single open reading frame with an apparent translational initiation site at nucleotide 

30 positions 119-120 and ending at the stop codon at nucleotide positions 1151-1 153 (Figure 160). The predicted 
polypeptide precursor is 344 amino acids long (Figure 161). The full-length PROl 109 protein shown in Figure 
161 has an estimated molecular weight of about 40,041 daltons and a pi of about 9.34. Analysis of the full- 
length PROl 109 sequence shown in Figure 161 (SEQ ID NO:236) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 27, potential N-glycosylation sites from about amino 

35 acid 4 to about amino acid 7, from about amino acid 220 to about amino acid 223 and from about amino acid 
335 to about amino acid 338 and an amino acid sequence block having homology to xylose isomerase proteins 
from about amino acid 191 to about amino acid 201. Clone DNA58737-1473 has been deposited with ATCC 
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on August 18, 1998 and is assigned ATCC deposit no. 203136. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 161 (SEQ ID NO: 236), evidenced significant 
homology between the PROl 109 amino acid sequence and the following Dayhoff sequences: HSUDPGALl, 
HSUDPB14J, NALSBOVIN, HSU10473J, CEW02B12J1, YNJ4CAEEL, AE000738J1, CET24D1J, 
5 S48121 and CEGLY91. 

EXAMPLE 70: Isolation of cDNA clones Encoding Human PRQ1383 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA53961. Based on the DNA53961 
10 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01383. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 , -CATTTCCTTACCCTGGACCCAGCTCC-3' (SEQ ID N0:242) 
15 reverse PCR primer 5' -GAAAGGCCCACAGCACATCTGGCAG-3 ' (SEQ ID NO: 243) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA53961 
sequence which had the following nucleotide sequence 
hybridization probe 

S'-CCACGACCCGAGCAACTTCCTCAAGACCGACTTGTTTCTCTACAGC^' (SEQ ID N0:244) 
20 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR01383 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal brain tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
25 PR01383 (designated herein as DNA58743-1609 [Figure 162, SEQ ID NO: 240]) and the derived protein 
sequence for PR01383. 

The entire nucleotide sequence of DNA58743-1609 is shown in Figure 162 (SEQ ID NO:240). Clone 
DNA58743-1609 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 122-124 and ending at the stop codon at nucleotide positions 1391-1393 (Figure 162). The predicted 

30 polypeptide precursor is 423 amino acids long (Figure 163). The full-length PR01383 protein shown in Figure 
163 has an estimated molecular weight of about 46,989 daltons and a pi of about 6.77. Analysis of the full- 
length PRO 1383 sequence shown in Figure 163 (SEQ ID NO: 241) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 24, a transmembrane domain from about amino acid 
339 to about amino acid 362, and potential N-glycosylation sites from about amino acid 34 to about amino acid 

35 37, from about amino acid 58 to about amino acid 61 , from about amino acid 142 to about amino acid 145, from 
about amino acid 197 to about amino acid 200, from about amino acid 300 to about amino acid 303 and from 
about amino acid 364 to about amino acid 367. Clone DNA58743-1609 has been deposited with ATCC on 
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August 25, 1998 and is assigned ATCC deposit no. 203154. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 163 (SEQ ID NO:241), evidenced significant 
homology between the PR01383 amino acid sequence and the following Dayhoff sequences: NMB_HUMAN, 
QNR_COTJA, PJV38335, P115_CHICK, P_W38164, A45993J, MMU70209J, D83704J andP_W39176. 

5 

EXAMPLE 71 : Isolation of cDNA Clones Encoding Human PRO1003 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as 43055. This sequence was then compared to a variety of EST 
databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database 

10 (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search 
was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmology 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

15 obtained therefrom is herein designated consenOl. 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 2849382, the Incyte EST clone 2849382 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 164. 

20 The entire nucleotide sequence of DNA58846-1409 is shown in Figure 164 (SEQ ID NO:245). Clone 

DNA58846-1409 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 41-43 and ending at the stop codon at nucleotide positions 293-295 (Figure 164). The predicted 
polypeptide precursor is 84 amino acids long (Figure 165). The full-length PRO1003 protein shown in Figure 
165 has an estimated molecular weight of about 9,408 daltons and a pi of about 9.28. Analysis of the full-length 

25 PRO 1003 sequence shown in Figure 165 (SEQ ID NO: 246) evidences the presence of a signal peptide at amino 
acids 1 to about 24, and a cAMP- and cGMP-dependent protein kinase phosphorylation site at about amino acids 
58 to about 61 . Analysis of the amino acid sequence of the full-length PRO 1003 polypeptide using the Dayhoff 
database (version 35.45 SwissProt 35) evidenced homology between the PRO 1003 amino acid sequence and the 
following Dayhoff sequences: AOPCZA363 3, SRTX_ATREN, A48298, MHVJHMSJ, VGL2CVMJH, 

30 DHDHTC22, CORT RAT, TAL6_HUMAN, P_W14123, and DVUFI 2. 

EXAMPLE 72: Isolation of cDNA Clones Encoding Human PRO1108 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA53237. 
35 In light of an observed sequence homology between the DNA53237 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2379881, the Incyte EST clone 2379881 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
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The sequence of this cDNA insert is shown in Figure 166 and is herein designated DNA58848-1472. 

The entire nucleotide sequence of DNA58848-1472 is shown in Figure 166 (SEQ ID NO:247). Clone 
DNA58848-1472 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 77-79 and ending at the stop codon at nucleotide positions 1445-1447 (Figure 166). The predicted 
polypeptide precursor is 456 amino acids long (Figure 167). The full-length PROl 108 protein shown in Figure 
5 167 has an estimated molecular weight of about 52,071 daltons and a pi of about 9.46. Analysis of the full- 
length PROl 108 sequence shown in Figure 167 (SEQ ID NO:248) evidences the presence of the following: type 
II transmembrane domains from about amino acid 22 to about amino acid 42, from about amino acid 156 to 
about amino acid 176, from about amino acid 180 to about amino acid 199 and from about amino acid 369 to 
about amino acid 388, potential N-glycosylaion sites from about amino acid 247 to about amino acid 250, from 

10 about amino acid 327 to about amino acid 330, from about amino acid 328 to about amino acid 331 and from 
about amino acid 362 to about amino acid 365 and an amino acid block having homology to ER lumen protein 
retaining receptor protein from about amino acid 153 to about amino acid 190. Clone DNA58848-1472 has been 
deposited with ATCC on June 9, 1998 and is assigned ATCC deposit no. 209955. 

Analysis of the amino acid sequence of the full-length PROl 108 polypeptide suggests that it possesses 

15 significant sequence similarity to the LPAAT protein, thereby indicating that PROl 108 may be a novel LPAAT 
homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PROl 108 amino acid sequence and the following Dayhoff sequences, 
AF015811J, CER07E3_2, YL35CAEEL, S73863, CEF59F4 4, P_W06422, MMU41736J, MTV008J9, 
P_R99248 and Y67_BPT7. 

20 

EXAMPLE 73 : Isolation of cDNA Clones Encoding Human PROl 137 

The extracellular domain (ECD) sequences (including the secretion signal, if any) of from about 950 

known secreted proteins from the Swiss-Prot public protein database were used to search expressed sequence 

tag (EST) databases. The EST databases included public EST databases (e.g., GenBank) and a proprietary EST 
25 DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA). The search was performed using the 

computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmology 266:460-480 (1996)) as a 

comparison of the ECD protein sequences to a 6 frame translation of the EST sequence. Using this procedure, 

Incyte EST No. 3459449, also referred to herein as "DNA7108", was identified as an EST having a BLAST 

score of 70 or greater that did not encode a known protein. 
30 A consensus DNA sequence was assembled relative to the DNA7108 sequence and other ESTs using 

repeated cycles of BLAST and the program "phrap" (Phil Green, Univ. of Washington, Seattle, WA). The 

consensus sequence obtained therefrom is referred to herein as DNA53952. 

In light of an observed sequence homology between the DNA53952 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3663102, the Incyte EST clone 3663102 was purchased 
35 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

The sequence of this cDNA insert is shown in Figure 168. 



436 



WO 99/63088 



PCT/US99/12252 



The entire nucleotide sequence of DNA58849-1494 is shown in Figure 168 (SEQ ID NO:249). Clone 
DNA58849-1494 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 77-79 and ending at the stop codon at nucleotide positions 797-799 (Figure 168). The predicted 
polypeptide precursor is 240 amino acids long (Figure 169). The full-length PRO 1 137 protein shown in Figure 
169 has an estimated molecular weight of about 26,064 daltons and a pi of about 8.65. Analysis of the full- 
5 length PROl 137 sequence shown in Figure 169 (SEQ ID NO:250) evidences the presence of a signal peptide 
at about amino acids 1 to 14 and a potential N-glycosylation site at about amino acids 101-105. 

Analysis of the amino acid sequence of the full-length PROl 137 polypeptide suggests that it possesses 
significant sequence similarity to ribosyltransferase thereby indicating that PROl 137 may be a novel member 
of the ribosyltransferase family of proteins. Analysis of the amino acid sequence of the full-length PROl 137 
10 polypeptide using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between the PRO 1 137 
amino acid sequence and the following Dayhoff sequences: MMART51, NARGJvlOUSE, GEN 11909, 
GEN13794, GEN14406, MMRNART62J, and P_R41876. 

EXAMPLE 74 : Isolation of cDNA clones Encoding Human PROl 138 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

Incyte EST sequence, Incyte cluster sequence no. 165212. This cluster sequence was then compared to a variety 
of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a 
proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

20 et al.. Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated as DNA54224. The assembly included a proprietary 
Genentech EST designated herein as DNA49140 (Figure 172; SEQ ID NO:254). 

25 In light of an observed sequence homology between the DNA54224 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3836613, the Incyte EST clone 3836613 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 170 and is the full-length DNA sequence for PROl 138. 
Clone DNA58850-1495 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 

30 209956. 

The entire nucleotide sequence of DNA58850-1495 is shown in Figure 170 (SEQ ID NO:252). Clone 
DNA58850-1495 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 38-40 and ending at the stop codon at nucleotide positions 1043-1045 (Figure 170). The predicted 
polypeptide precursor is 335 amino acids long (Figure 171). The full-length PROl 138 protein shown in Figure 
35 171 has an estimated molecular weight of about 37,421 Daltons and a pi of about 6.36. Analysis of the full- 
length PROl 138 sequence shown in Figure 171 (SEQ ID NO:253) evidences the presence of the following 
features: a signal peptide at about amino acid 1 to about amino acid 22; a transmembrane domain at about amino 

437 



WO 99/63088 



PCT/US99/12252 



acids 224 to about 250; a leucine zipper pattern at about amino acids 229 to about 250; and potential N- 
glycosylation sites at about amino acids 98-101, 142-145, 148-151, 172-175, 176-179, 204-207, and 291-295. 

Analysis of the amino acid sequence of the lull-length PRO 1 138 polypeptide suggests that it possesses 
significant sequence similarity to the CD84, thereby indicating that PROl 138 may be a novel member of the Ig 
superfamily of polypeptides. More particularly, analysis of the amino acid sequence of the full-length PROl 138 
5 polypeptide using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between the PROl 138 
amino acid sequence and the following Dayhoff sequences: HSU82988_1 , HUMLY9J , P_R9763 1 , P_R97628, 
P_R97629, P_R97630, CD48_RAT, CD2_HUMAN, P_P93996, and HUMBGPJ. 

Clone DNA58850-1495 was deposited with ATCC on June 9, 1998, and is assigned ATCC deposit no. 

209956. 

10 

EXAMPLE 75 : Isolation of cDNA clones Encoding Human PRO 1054 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 66212. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 

15 and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 

20 consensus sequence obtained therefrom is herein designated DNA55722. 

In light of an observed sequence homology between the DNA55722 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 319751, the Incyte EST clone 319751 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 173 and is herein designated as DNA58853-1423. 

25 Clone DNA58853-1423 contains a single open reading frame with an apparent translations initiation 

site at nucleotide positions 46-48 and ending at the stop codon at nucleotide positions 586-588 (Figure 173). The 
predicted polypeptide precursor is 180 amino acids long (Figure 174). The full-length PRO1054 protein shown 
in Figure 174 has an estimated molecular weight of about 20,638 daltons and a pi of about 5.0. Analysis of the 
full-length PRO 1054 sequence shown in Figure 174 (SEQ ID NO: 256) evidences the presence of the following: 

30 a signal peptide from about amino acid 1 to about amino acid 1 8, a leucine zipper pattern from about amino acid 
155 to about amino acid 176 and amino acid sequence blocks having homology to lipocalin proteins from about 
amino acid 27 to about amino acid 38 and from about amino acid 110 to about amino acid 120. Clone 
DNA58853-1423 has been deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203016. 
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 174 (SEQ ID NO:256), evidenced significant 
homology between the PRO1054 amino acid sequence and the following Dayhoff sequences: MUPl_MOUSE, 
MUP6J40USE, MUP2_MOUSE, MUP8_MOUSE, MUP5_MOUSE, MUP4 MOUSE, S10124, 
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MUPM_MOUSE, MUPRAT and ECU70823J. 

EXAMPLE 76 : Isolation of cDNA clones Encoding Human PRQ994 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 157555. This EST cluster sequence was then compared 
5 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) 
and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

10 DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA55728. 

In light of an observed sequence homology between the DNA55728 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2860366, the Incyte EST clone 2860366 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

15 The sequence of this cDNA insert is shown in Figure 175 and is herein designated as DNA5 8855- 1422. 

Clone DNA58855-1422 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 31-33 and ending at the stop codon at nucleotide positions 718-720 (Figure 175). The 
predicted polypeptide precursor is 229 amino acids long (Figure 176). The full-length PR0994 protein shown 
in Figure 176 has an estimated molecular weight of about 25,109 daltons and a pi of about 6.83. Analysis of 

20 the full-length PR0994 sequence shown in Figure 176 (SEQ ID NO: 258) evidences the presence of the 
following: transmembrane domains from about amino acid 10 to about amino acid 31, from about amino acid 
50 to about amino acid 72, from about amino acid 87 to about amino acid 110 and from about amino acid 191 
to about amino acid 213, potential N-glycosylation sites from about amino acid 80 to about amino acid 83, from 
about amino acid 132 to about amino acid 135, from about amino acid 148 to about amino acid 151 and from 

25 about amino acid 163 to about amino acid 166 and an amino acid block having homology to TNFR/NGFR 
cysteine-rich region proteins from about amino acid 4 to about amino acid 1 1 . Clone DNA58855-1422 has been 
deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203018. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 176 (SEQ ID NO:258), evidenced significant 

30 homology between the PR0994 amino acid sequence and the following Dayhoff sequences: AF027204 1, 
TAL6_HUMAN, ILT4HUMAN, JC6205, MMU57570J, S40363, ETU56093J, S42858, P_R66849 and 
P_R74751. 

EXAMPLE 77 : Isolation of cDNA clones Encoding Human PRQ812 
35 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 170079. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
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and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Methods in Enzvmology 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
5 consensus sequence obtained therefrom is herein designated as DNA55721 . 

In light of an observed sequence homology between the DNA55721 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 388964, the Incyte EST clone 388964 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 177 and is herein designated as DNA59205-1421. 

10 Clone DNA59205-1421 contains a single open reading frame with an apparent translational initiation 

site at nucleotide positions 55-57 and ending at the stop codon at nucleotide positions 304-306 (Figure 177). The 
predicted polypeptide precursor is 83 amino acids long (Figure 178). The full-length PR0812 protein shown 
in Figure 178 has an estimated molecular weight of about 9,201 daltons and a pi of about 9,3. Analysis of the 
full-length PR0812 sequence shown in Figure 178 (SEQ ID NO:260) evidences the presence of the following: 

15 a signal peptide from about amino acid 1 to about amino acid 15, a cAMP- and cGMP-dependent protein kinase 
phosphorylation site from about amino acid 73 to about amino acid 76 and protein kinase C phosphorylation sites 
from about amino acid 70 to about amino acid 72 and from about amino acid 76 to about amino acid 78. Clone 
DNA59205-1421 has been deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203009. 
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

20 alignment analysis of the full-length sequence shown in Figure 178 (SEQ ID NO:260), evidenced significant 
homology between the PR0812 amino acid sequence and the following Dayhoff sequences: P_W35802, 
PW35803, PSC1JRAT, S68231, GEN13917, PSC2_RAT, CC 10_HUM A N,UTER_RABIT,AF008595 J and 
A56413. 

25 EXAMPLE 78 : Isolation of cDNA clones Encoding Human PRO1069 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence designated herein as 100727. This sequence was then compared to a proprietary EST 
DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 

30 Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "paraph (Phil Green, Univ. of Washington, Seattle, Washington). The consensus sequence obtained 
therefrom is herein designated DNA56001. 

In light of an observed sequence homology between the DNA56001 consensus sequence and an EST 

35 sequence encompassed within the Incyte EST clone no. 3533881 , the Incyte EST clone 3533881 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 179 and is the full-length DNA sequence for PRO1069. 
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Clone DNA5921 1-1450 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 
209960. 

The entire nucleotide sequence of DNA5921 1-1450 is shown in Figure 179 (SEQ ID NO:261). Clone 
DNA5921 1-1450 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 197-199 and ending at the stop codon at nucleotide positions 464-466. The predicted polypeptide 
5 precursor is 89 amino acids long (Figure 180). The full-length PRO1069 protein shown in Figure 180 has an 
estimated molecular weight of about 9,433 daltons and a pi of about 8.21 . Analysis of the full-length PRO 1069 
sequence shown in Figure 180 (SEQ ID NO:262) evidences the presence of the following features: a signal 
peptide sequence at amino acid 1 to about 16; a transmembrane domain at about amino acids 36 to about 59; 
potential N-myristoylation sites at about amino acids 41-46, 45-50, and 84-89; and homology with extracellular 
10 proteins SCP/Tpx-l/Ag5/PR-l/Sc7 at about amino acids 54 to about 66. 

Analysis of the amino acid sequence of the full-length PRO 1069 polypeptide suggests that it possesses 
significant sequence similarity to CHIF, thereby indicating that PRO1069 may be a member of the CHIF family 
of polypeptides. More particularly, analysis of the amino acid sequence of the full-length PRO1069 polypeptide 
using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between the PRO1069 amino acid 
15 sequence and the following Dayhoff sequences: CHIF RAT, A55571, PLM HUMAN, A40533, 
ATNGBOVIN, RICJvlOUSE, PETDJ5YNY3, VTB1XENLA, A05009, and S75086. 

Clone DNA5921 1-1450 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit 
no. 209960. 

20 EXAMPLE 79 : Isolation of cDNA Clones Encoding Human PRO 1 129 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as 98833. The Incyte EST cluster sequence no. 98833 sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) 

25 to identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56038. 

30 In light of an observed sequence homology between the DNA56038 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 1335241, the Incyte EST clone 1335241 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 181 and is herein designated DNA59213-1487. 

The full length clone shown in Figure 181 contained a single open reading frame with an apparent 

35 translational initiation site at nucleotide positions 42-44 and ending at the stop codon found at nucleotide positions 
1614-1616 (Figure 181; SEQ ID NO:263). The predicted polypeptide precursor is 524 amino acids long, has 
a calculated molecular weight of approximately 60,310 daltons and an estimated pi of approximately 7.46. 
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Analysis of the full-length PROl 129 sequence shown in Figure 182 (SEQ ID NO:264) evidences the presence 
of the following: type II transmembrane domains from about amino acid 13 to about amino acid 32 and from 
about amino acid 77 to about amino acid 102, a cytochrome P-450 cysteine heme-iron ligand signature sequence 
from about amino acid 461 to about amino acid 470 and potential N-glycosylation sites from about amino acid 
1 12 to about amino acid 1 15 and from about amino acid 168 to about amino acid 171 . Clone DNA59213- 1487 
5 has been deposited with the ATCC on June 9, 1998 and is assigned ATCC deposit no. 209959. 

Analysis of the amino acid sequence of the full-length PROl 129 polypeptide suggests that it possesses 
sequence similarity to the cytochrome P-450 family of proteins. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PROl 129 amino acid 
sequence and the following Dayhoff sequences, AC004523 1, S45702, AF054821J and 153015. 

10 

EXAMPLE 80 : Isolation of cDNA clones Encoding Human PRO 1068 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the LIFESEQ® database, designated Incyte cluster no. 14 1 736. This EST cluster sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 

15 (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. One or more of the ESTs was derived from a human mast cell line from a patient 
with mast cell leukemia. The homology search was performed using the computer program BLAST or BLAST2 
(Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 

20 consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA56094. 

In light of an observed sequence homology between the DNA56094 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 004974, the Incyte EST clone 004974 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

25 The sequence of this cDNA insert is shown in Figure 183 and is herein designated as DNA59214-1449 (SEQ 
ID NO:265). 

The full length clone shown in Figure 183 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 42-44 and ending at the stop codon found at nucleotide positions 
414^16 (Figure 183; SEQ ID NO:265). The predicted polypeptide precursor (Figure 184, SEQ ID NO:266) 

30 is 124 amino acids long. PRO1068 has a calculated molecular weight of approximately 14,284 daltons and an 
estimated pi of approximately 8.14. The PRO1068 polypeptide has the following additional features, as 
indicated in Figure 184: a signal peptide sequence at about amino acids 1-20, a urotensin II signature sequence 
at about amino acids 118-123, a cell auachment sequence at about amino acids 64-66, and a potential cAMP- 
and cGMP-dependent protein kinase phosphorylation site at about amino acids 112-1 15. 

35 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 184 (SEQ ID NO:266), revealed homology 
between the PRO 1068 amino acid sequence and the following Dayhoff sequences: HALBOP1, MTV043 36, 
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150498, and P_R78445 

Clone DNA59214-1449 was deposited with the ATCC on July 1, 1998 and is assigned ATCC deposit 
no.203O46. 

EXAMPLE 81: Isolation of cDNA clones Encoding Human PRO 1066 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

lncyte EST cluster sequence designated herein as 79066. The Incyte EST cluster sequence no. 79066 sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) 
to identify existing homologies. The homology search was performed using the computer program BLAST or 

10 BLAST2 (Aitshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56121. 

In light of an observed sequence homology between the DNA56121 consensus sequence and an EST 

15 sequence encompassed within the Incyte EST clone no. 1515315, the Incyte EST clone 1515315 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 185 and is herein designated DNA59215-1425. 

The full length clone shown in Figure 185 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 176-178 and ending at the stop codon found at nucleotide 

20 positions 527-529 (Figure 185; SEQ ID NO:267). The predicted polypeptide precursor is 1 17 amino acids long, 
has a calculated molecular weight of approximately 12,91 1 daltons and an estimated pi of approximately 5.46. 
Analysis of the full-length PRO1066 sequence shown in Figure 186 (SEQ ID NO:268) evidences the presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 23, a cAMP- and cGMP- 
dependent protein kinase phosphorylation site from about amino acid 38 to about amino acid 41 and potential 

25 N-myristoylation sites from about amino acid 5 to about amino acid 10, from about amino acid 63 to about amino 
acid 68 and from about amino acid 83 to about amino acid 88. Clone UNQ524 (DNA59215-1425) has been 
deposited with the ATCC on June 9, 1998 and is assigned ATCC deposit no. 209961. 

Analysis of the amino acid sequence of the full-length PRO 1066 polypeptide suggests that it does not 
possess significant sequence similarity to any known human protein. However, an analysis of the Dayhoff 

30 database (version 35.45 SwissProt 35) evidenced some degree of homology between the PRO1066 amino acid 
sequence and the following Dayhoff sequences, MOTI HUMAN, AF025667 1, MTCY19H9J and 
RABIGKCHJ. 

EXAMPLE 82 : Isolation of cDNA Clones Enc oding Human PROl 184 
35 Use of the signal sequence algorithm described in Example 3 on ESTs from an Incyte database allowed 

identification a candidate sequence designated herein as DNA56375. This sequence was then compared to a 
variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
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a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program tt phrap w (Phil Green, University of Washington, Seattle, Washington). The 
5 consensus sequence obtained therefrom is herein designated DNA56375. 

In light of an observed sequence homology between the DNA56375 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 1428374, the Incyte EST clone 1428374 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a mil-length protein. 
The sequence of this cDNA insert is shown in Figure 187. 

10 The full length clone shown in Figure 187 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 106-108 and ending at the stop codon found at nucleotide 
positions 532-534 (Figure 187; SEQ ID NO:269). The predicted polypeptide precursor is 142 amino acids long, 
has a calculated molecular weight of approximately 15,690 daltons and an estimated pi of approximately 9.64. 
Analysis of the full-length PROl 184 sequence shown in Figure 188 (SEQ ID NO:270) evidences the presence 

15 of a signal peptide at about amino acids 1-38. Clone DNA59220-1514 has been deposited with the ATCC on 
June 9, 1998. It is understood that the deposited clone has the actual sequences and that representations are 
presented herein. 

Analysis of the amino acid sequence of the full-length PROl 184 polypeptide suggests mat it possesses 
some sequence identity with a protein called TIM from Drosophila virilis, designated "DVTIMS02 _P in the 
20 Dayhoff data base, (version 35.45 SwissProt 35). Other 

Dayhoff database (version 35.45 SwissProt 35) sequences having some degree of sequence identity with 
PROl 184 include: WIS1 SCHPO, FG02186J, ATAC00239124 and MSAIPRP 1. 

EXAMPLE 83 : Isolation of cDNA clones Encoding Human PRO 1360 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
sequence from an Incyte database, designated DNA 10572. This EST sequence was then compared to a variety 
of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank, Merck/Wash. 
U.) and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 
(Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA57314. 

In light of an observed sequence homology between the DNA57314 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA406443, the Merck EST clone AA406443 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 189 and is herein designated as DNA59488-1603. 
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The full length clone shown in Figure 189 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 54-56 and ending at the stop codon found at nucleotide positions 
909-91 1 (Figure 189; SEQ ID NO:271). The predicted polypeptide precursor (Figure 190, SEQ ID NO:272) 
is 285 amino acids long. PRO1360 has a calculated molecular weight of approximately 31,433 daltons and an 
estimated pi of approximately 7.32. Clone DNA59488-1603 was deposited with the ATCC on August 25, 1998 
5 and is assigned ATCC deposit no. 203157. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 190 (SEQ ID NO: 272), revealed sequence identity 
between the PRO1360 amino acid sequence and the following Dayhoff sequences: UN51CAEEL, 
YD4BJSCHPO, AF000634J, GFO_ZYMMO, YE1J_SCHP0, D86566J, ZMGFOJ, S76976, 
10 PPSAJSYNY3, and CEF28B1_4. 

EXAMPLE 84 : Isolation of cDNA clones Encoding Human PRO 1029 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 18763. This EST cluster sequence was then compared 

15 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et ah, Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

20 DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA57854. 

In light of an observed sequence homology between the DNA57854 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T98880, the Merck EST clone T98880 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

25 The sequence of this cDNA insert is shown in Figure 191 and is herein designated as DNA59493-1420. 

Clone DNA59493-1420 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 39-41 and ending at the stop codon at nucleotide positions 297-299 (Figure 191). The 
predicted polypeptide precursor is 86 amino acids long (Figure 192). The full-length PRO 1029 protein shown 
in Figure 192 has an estimated molecular weight of about 9,548 daltons and a pi of about 8.52. Analysis of the 

30 full-length PRO1029 sequence shown in Figure 192 (SEQ ID NO:274) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 19, an amino acid block having homology to 
bacterial rhodopsins retinal binding site protein from about amino acid 50 to about amino acid 61 , a prenyl group 
binding site from about amino acid 83 to about amino acid 86 and a potential N-glycosylation site from about 
amino acid 45 to about amino acid 48. Clone DNA59493-1420 has been deposited with ATCC on July 1, 1998 

35 and is assigned ATCC deposit no. 203050, 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 192 (SEQ ID NO:274), evidenced significant 
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homology between the PRO1029 amino acid sequence and the following Dayhoff sequences: S66088, 
AF031815J, MM4A6LJ, PSEIS52a-l, S17699 and P_R63635. 

EXAMPLE 85 : Isolation of cDNA clones Encoding Human PRQ1139 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
5 cluster sequence from the Incyte database, designated 4461. This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 

10 in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA57312. 

The DNA57312 consensus sequence included a 172 nucleotides long public EST (T62095, 
Merck/University of Washington public database). This EST clone, identified herein as a putative protein coding 

15 sequence, was purchased from Merck, and sequenced to provide the coding sequence of PROl 139 (Figure 193). 
As noted before, the deduced amino acid sequence of DN A59497- 1496 shows a significant sequence identity with 
the deduced amino acid sequence of HSOBRGRP_l. The full-length protein (Figure 194) contains a putative 
signal peptide between amino acid residues 1 and about 28, and three putative transmembrane domains 
(approximate amino acid residues 33-52, 71-89, 98-120). 

20 

EXAMPLE 86 : Isolation of cDNA clones Encoding Human PRQ1309 

An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) 
was searched and an EST was identified which showed homology to SLIT. 

RNA for construction of cDNA libraries was isolated from human fetall brain tissue. The cDNA 
25 libraries used to isolate the cDN A clones encoding human PRO 1 309 were constructed by standard methods using 
commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with 
oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized 
appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as 
pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science . 
30 253: 1278-1280 (1991)) in the unique Xhol and NotI. 

The cDNA libraries (prepared as described above), were screened by hybridization with a synthetic 
oligonucleotide probe derived from the above described Incyte EST sequence: 

S'-TCCGTCCAGGGGGACGCCTTTCAGAAACTGCGCCGAGTrAAGGAAC^ (SEQ ID NO:279). 

A cDNA clone was isolated and sequenced in entirety. The entire nucleotide sequence of DNA59588- 
35 1571 is shown in Figure 195 (SEQ ID NO:277). Clone DNA59588-1571 contains a single open reading frame 
with an apparent translation^ initiation site at nucleotide positions 720-722 and a stop codon at nucleotide 
positions 2286-2288 (Figure 195; SEQ ID NO:277). The predicted polypeptide precursor is 522 amino acids 
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long. The signal peptide is approximately at 1-34 and the transmembrane domain is at approximately 428-450 
of SEQ ID NO:278. Clone DNA59588-1571 has been deposited with ATCC and is assigned ATCC deposit no. 
203106. The full-length PRO1309 protein shown in Figure 196 has an estimated molecular weight of about 
58,614 daltons and a pi of about 7.42. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
5 alignment analysis of the full-length sequence shown in Figure 196 (SEQ ID NO:278), revealed sequence identity 
between the PRO1309 amino acid sequence and the following Dayhoff sequences: AB007876_1 , GPVMOUSE, 
ALS_RAT, P_R85889, LUM_CHICK, AB014462J, PGS1_CANFA, CEM88J7, A58532 and GEN11209. 

EXAMPLE 87 : Isolation of cDNA Clones Encoding Human PRO1028 

10 Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identity existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 

15 Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA59603. 

In light of an observed sequence homology between the DNA59603 sequence and an EST sequence 

20 contained within Incyte EST clone no. 1497725, the Incyte EST clone no. 1497725 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 197 and is herein designated as DNA59603-1419. 

The entire nucleotide sequence of DNA59603-1419 is shown in Figure 197 (SEQ ID NO:280). Clone 
DNA59603-1419 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 

25 positions 21-23 and ending at the stop codon at nucleotide positions 612-614 (Figure 197). The predicted 
polypeptide precursor is 197 amino acids long (Figure 198). The full-length PRO 1028 protein shown in Figure 
. 198 has an estimated molecular weight of about 20,832 daltons and a pi of about 8.74. Clone DNA59603-1419 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

30 Analyzing the amino acid sequence of SEQ ID NO:281, the putative signal peptide is at about amino 

acids 1-19 of SEQ ID NO:281. An N-glycosylation site is at about amino acids 35-38 of SEQ ID NO:281. A 
C-rype lectin domain is at about amino acids 108-1 17 of SEQ ID NO:281 , indicating that PR0513 may be related 
to or be a lectin. The corresponding nucleotides of these amino acid sequences or others can be routinely 
determined given the sequences provided herein. 

35 
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EXAMPLE 88: Isolation of cDNA Clones Encoding Human PRO 1027 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56399. 

10 In light of an observed sequence homology between the DNA56399 sequence and an EST sequence 

contained within Incyte EST clone no. 937605, the Incyte EST clone no. 937605 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 199 and is herein designated as DNA59605-1418. 

The entire nucleotide sequence of DNA59605-1418 is shown in Figure 199 (SEQ ID NO:282). Clone 

15 DNA59605-1418 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 31-33 and ending at the stop codon at nucleotide positions 262-264 (Figure 199). The predicted 
polypeptide precursor is 77 amino acids long (Figure 200). The full-length PRO1027 protein shown in Figure 
200 has an estimated molecular weight of about 8,772 daltons and a pi of about 9.62. Clone DNA59605-1418 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 

20 the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO:283, the putative signal peptide is at about amino 
acids 1-33 of SEQ ID NO:283. The type II fibronectin collagen-binding domain begins at about amino acid 30 
of SEQ ID NO:283. The corresponding nucleotides for these amino acid sequences and others can be routinely 
determined given the sequences provided herein. PRO1027 may be involved in tissue formation or repair. 

25 The following Dayhoff designations appear to have some sequence identity with PRO 1027: 

SFT2_YEAST; ATM3E9_2; A69826; YM16_MARPO; E64896; U60193_2; MTLRAJ205J; MCU60315J70; , 
SPAS_SHIFL; and S54213. 

EXAMPLE 89 : Isolation of cDNA Clones Encoding Human PRO1107 

30 Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 

35 Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA56402. 

In light of an observed sequence homology between the DNA56402 sequence and an EST sequence 
contained within Incyte EST clone no. 3203694, the Incyte EST clone no. 3203694 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 201 and is herein designated as DNA59606-1471. 
5 The entire nucleotide sequence of DNA59606-1471 is shown in Figure 201 (SEQ ID NO:284). Clone 

DNA59606-1471 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 244-246 and ending at the stop codon at nucleotide positions 1675-1677 of SEQ ID NO:284 (Figure 
201). The predicted polypeptide precursor is 477 amino acids long (Figure 202). The full-length PROl 107 
protein shown in Figure 202 has an estimated molecular weight of about 54,668 daltons and a pi of about 6.33. 
10 Clone DNA59606-1471 has been deposited with ATCC on June 9, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
sequencing techniques. 

Analysis of the amino acid sequence of the full-length PROl 107 polypeptide suggests that it possesses 
significant sequence similarity to phosphodiesterase I/nucleotide phyrophosphatase, human insulin receptor 
15 tyrosine kinase inhibitor, alkaline phosphodiesterase and autotaxin, thereby indicating that PROl 107 may have 
at least one or all of the activities of these proteins, and that PROl 107 is a novel phosphodiesterase. More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced sequence identity 
between the PROl 107 amino acid sequence and at least the following Dayhoff sequences: AF005632 1, 
P_R79148, RNU78787J, AF060218_4, A57080 and HUMATXTl. 

20 

EXAMPLE 90 : Isolation of cDNA clones Encoding Human PROl 140 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence, Incyte cluster sequence No. 135917. This sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 

25 EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, Univ. of Washington, Seattle, Washington). The consensus sequence obtained 

30 therefrom is herein designated DNA56416. 

In light of an observed sequence homology between DNA56416 and an EST sequence contained within 
Incyte EST clone no. 3345705, Incyte EST clone no. 3345705 was obtained and its insert sequenced. It was 
found that the insert encoded a full-length protein The sequence, designated herein as DNA59607-1497, which 
is shown in Figure 203, is the full-length DNA sequence for PROl 140. Clone DNA59607-1497 was deposited 

35 with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 209946. 

The entire nucleotide sequence of DNA59607-1497 is shown in Figure 203 (SEQ ID NO:286). Clone 
DNA59607-1497 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 210-212 and ending at the stop codon at nucleotide positions 975-977 (Figure 203). The predicted 
polypeptide precursor is 255 amino acids long (Figure 204). The full-length PROl 140 protein shown in Figure 
204 has an estimated molecular weight of about 29,405 daltons and a pi of about 7.64. Analysis of the full- 
length PROl 140 sequence shown in Figure 204 (SEQ ID NO:287) evidences the presence of three 
transmembrane domains at about amino acids 101 to 118, 141 to 161 and 172 to 191. 
5 Analysis of the amino acid sequence of the full-length PROl 140 polypeptide using the Dayhoff database 

(version 35.45 SwissProt 35) evidenced homology between the PROl 140 amino acid sequence and the following 
Dayhoff sequences: AF023602J, AF000368J, CIN3_RAT, AF003373J, GEN13279, and AF003372J. 

Clone DNA59607-1497 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit 
no. 209946. 

10 

EXAMPLE 91 : Isolation of cDNA clones Encoding Human PROl 106 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence. This sequence was then compared to a variety of expressed sequence tag (EST) databases 
which included public EST databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte 

15 Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2(Altshuletal., Methods in Enzvmology 266:460-480 (1996)). Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 
Univ. of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated 

20 DNA56423. 

In light of an observed sequence homology between DNA56423 and an EST sequence contained within 
Incyte EST clone no. 1711247, Incyte EST clone no. 1711247 was obtained and its insert sequenced. It was 
found that the insert encoded a full-length protein The sequence, designated herein as DNA59609-1470, which 
is shown in Figure 205, is the full-length DNA sequence for PROl 106. Clone DNA59609-1470 was deposited 
25 with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 209963. 

The entire nucleotide sequence of DNA59609-1470 is shown in Figure 205 (SEQ ID NO:288). Clone 
DNA59609-1470 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 61-63 and ending at the stop codon at nucleotide positions 1468-1470 of SEQ ID NO:288 (Figure 205). 
The predicted polypeptide precursor is 469 amino acids long (Figure 206). The full-length PRO1106 protein 
30 shown in Figure 206 has an estimated molecular weight of about 52,689 daltons and a pi of about 8.68. It is 
understood that the skilled artisan can construct the polypeptide or nucleic acid encoding therefor to exclude any 
one or more of all of these domains. For example, the transmembrane domain region(s) and/or either of the 
amino terminal or carboxyl end can be excluded. Clone DNA59609-1470 has been deposited with ATCC on 
June 9, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that the 
35 sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO 1 106 polypeptide suggests that it possesses 
significant sequence similarity to the peroxisomal ca-dependent solute carrier, thereby indicating that PROl 106 
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may be a novel transporter. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 
35) evidenced sequence identity between the PRO 1106 amino acid sequence and at least the following Dayhoff 
sequences, AF004161J, IG002N01_25, GDC_BOVIN and BT1_MAIZE. 

EXAMPLE 92 : Isolation of cDNA clones Encoding Human PRO 1291 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 120480. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

10 et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56425. 

In light of an observed sequence homology between the DNA56425 sequence and an EST sequence 

15 encompassed within the Incyte EST clone no. 2798803, the Incyte EST clone 2798803 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 207 and is herein designated as DNA59610-1556. 

Clone DNA59610-1556 contains a single open reading frame with an apparent translation^ initiation 
site at nucleotide positions 61-63 and ending at the stop codon at nucleotide positions 907-909 (Figure 207). The 

20 predicted polypeptide precursor is 282 amino acids long (Figure 208). The full-length PR01291 protein shown 
in Figure 208 has an estimated molecular weight of about 30,878 daltons and a pi of about 5.27. Analysis of 
the full-length PR01291 sequence shown in Figure 208 (SEQ ID NO:291) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 28, a transmembrane domain from about 
amino acid 258 to about amino acid 281 and potential N-glycosylation sites from about amino acid 1 12 to about 

25 amino acid 1 15, from about amino acid 160 to about amino acid 163 , from about amino acid 190 to about amino 
acid 193, from about amino acid 196 to about amino acid 199, from about amino acid 205 to about amino acid 
208, from about amino acid 216 to about amino acid 219 and from about amino acid 220 to about amino acid 
223. . Clone DNA596 10-1556 has been deposited with ATCC on June 16, 1998 and is assigned ATCC deposit 
no. 209990. 

30 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 208 (SEQ ID NO:291), evidenced significant 
homology between the PR01291 amino acid sequence and the following Dayhoff sequences: HSU90552_1, 
HSU90144J, AF033107J, HSB73J, HSU90142J, GGCD80J, P_W34452, MOGJVIOUSE, B39371 and 
P_R71360. 

35 



451 



WO 99/63088 



PCT/US99/12252 



EXAMPLE 93 : Isolation of cDNA clones Encoding Human PRO 11 05 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56430. 

10 In light of an observed sequence homology between the DNA56430 sequence and an EST sequence 

encompassed within the Incyte EST clone no. 1853047, the Incyte EST clone 1853047 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 209 and is herein designated as DNA59612-1466. 

The entire nucleotide sequence of DNA59612-1466 is shown in Figure 209 (SEQ ID NO:292). Clone 

15 DNA59612-1466 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 28-30 and ending at the stop codon at nucleotide positions 568-570 of SEQ ID NO:292 (Figure 209). 
The predicted polypeptide precursor is 180 amino acids long (Figure 210). The full-length PROI105 protein 
shown in Figure 210 has an estimated molecular weight of about 20,040 daltons and a pi of about 8.35. Clone 
DNA59612-1466 has been deposited with the ATCC on June 9, 1998. It is understood that the deposited clone 

20 has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 210, a signal peptide is at about amino acids 1-19 of SEQ ID NO:293 and 
transmembrane domains are shown at about amino acids 80-99 and 145-162 of SEQ ID NO:293. It is 
understood that the skilled artisan could form a polypeptide with all of or any combination or individual selection 
25 of these regions. It is also understood that the corresponding nucleic acids can be routinely identified and 
prepared based on the information provided herein. 

EXAMPLE 94 : Isolation of cDNA clones Encoding Human PRQ51 1 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

30 cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460>480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

35 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56434. 
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In light of an observed sequence homology between the DNA56434 sequence and an EST sequence 
encompassed within the Incyte EST clone no. 1227491, the Incyte EST clone 1227491 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 21 1 and is herein designated as DNA59613-1417. 

The entire nucleotide sequence of DNA59613-14 17 is shown in Figure 21 1 (SEQ ID NO:294). Clone 
5 DNA59613-1417 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 233*235 and ending at the stop codon at nucleotide positions 944-946 (Figure 211). The predicted 
polypeptide precursor is 237 amino acids long (Figure 212). The full-length PR0511 protein shown in Figure 
212 has an estimated molecular weight of about 25,284 daltons and a pi of about 5.74. Clone DNA59613-1417 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
10 the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO:295, the putative signal peptide is at about amino 
acids 1-25 of SEQIDNO:295. The N-glycosylation sites are at about amino acids 45-48, 73-76, 107-110, 118- 
121, 132-135, 172-175, 175-178 and 185-188 of SEQ ID NO:295. An arthropod defensins conserved region 
is at about amino acids 176-182 of SEQ ID NO:295. A kringle domain begins at about amino acid 128 of SEQ 
15 ID NO:295 and a ly-6/u-PAR domain begins at about amino acid 6 of SEQ ID NO:295. The corresponding 
nucleotides of these amino acid sequences and others can be routinely determined given the sequences provided 
herein. 

The designations appearing in a Dayhoff database with which PR05 1 1 has some sequence identity are 
as follows: SSC20F10J; SF041083; P_W26579; S44208; JC2394; PSTAJMCDI; A27020; S59310; 
20 RAG1_RABIT; and MUSBALBC1J . 

EXAMPLE 95 : Isolation of cDNA clones Encoding Human PROl 104 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

25 expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 

30 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56446. 

In light of an observed sequence homology between the DNA56446 sequence and an EST sequence 
encompassed within the Incyte EST clone no. 2837496, the Incyte EST clone 2837496 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 

35 sequence of this cDNA insert is shown in Figure 213 and is herein designated as DNA59616-1465. 

The entire nucleotide sequence of DNA59616-1465 is shown in Figure 213 (SEQ ID NO:296). Clone 
DNA59616-1465 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 109-1 11 and ending at the stop codon at nucleotide positions 1 132-1 134 of SEQ ID NO:296 (Figure 
213). The predicted polypeptide precursor is 341 amino acids long (Figure 214). The full-length PRO1104 
protein shown in Figure 214 has an estimated molecular weight of about 36,769 daltons and a pi of about 9.03. 
Clone DNA59616-1465 has been deposited with ATCC on June 16, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
5 sequencing techniques . 

Analyzing Figure 214, a signal peptide is at about amino acids 1-22 of SEQ ID NO:297. N- 
myristoylation sites are at about amino acids 41-46, 110-115, 133-138, 167-172 and 179-184 of SEQ ID 
NO:297. 

10 EXAMPLE 96 : Isolation of cDNA clones Encoding Human PROl 100 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

15 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). 

In light of an observed sequence homology between the obtained consensus sequence and an EST 

20 sequence encompassed within the Incyte EST clone no. 2305379, the Incyte EST clone 2305379 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 215 and is herein designated as DNA59619-1464. 

The entire nucleotide sequence of DNA59619-1464 is shown in Figure 215 (SEQ ID NO:298). Clone 
DNA59619-1464 contains a single open reading frame with an apparent translational initiation site at nucleotide 

25 positions 33-35 and ending at the stop codon at nucleotide positions 993-995 of SEQ ID NO:298 (Figure 215). 
The predicted polypeptide precursor is 320 amino acids long (Figure 216). The full-length PROl 100 protein 
shown in Figure 216 has an estimated molecular weight of about 36,475 daltons and a pi of about 7.29. Clone 
DNA59619-1464 has been deposited with ATCC on July 1 , 1998. It is understood that the deposited clone has 
the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 

30 techniques. 

Upon analyzing SEQ ID NO:299, the approximate locations of the signal peptide, the transmembrane 
domains, an N-glycosylation site, an N-myristoylation site, a CUB domain and an amiloride-sensitive sodium 
channel domain are present. It is believed that PROl 100 may function as a channel. The corresponding nucleic 
acids for these amino acids and others can be routinely determined given SEQ ID NO:299.. 

35 
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EXAMPLE 97 : Isolation of cDNA clones Encoding Human PRQ836 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained is herein designated DNA56453. 

10 In light of an observed sequence homology between the DNA56453 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2610075, the Incyte EST clone 2610075 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 217 and is herein designated as DNA59620-1463. 

The entire nucleotide sequence of DNA59620-1463 is shown in Figure 217 (SEQ ID NO:300). Clone 

15 DNA59620-1463 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 65-67 and ending at the stop codon at nucleotide positions 1448- 1450 of SEQ ID NO:300 (Figure 217). 
The predicted polypeptide precursor is 461 amino acids long (Figure 218). The full-length PR0836 protein 
shown in Figure 218 has an estimated molecular weight of about 52,085 daltons and a pi of about 5.36. Analysis 
of the full-length PR0836 sequence shown in Figure 218 (SEQ ID NO:301) evidences the presence of the 

20 following: a signal peptide, N-glycosylation sites, N-myristoylation sites, a domain conserved in the 
YJL126w/YLR351c/yhcX family of proteins, and a region having sequence identity with SLS1. Clone 
DNA59620-1463 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone 
has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

25 Analysis of the amino acid sequence of the full-length PR0836 polypeptide suggests that it possesses 

some sequence similarity to SLS1, thereby indicating that PR0836 may be involved in protein translocation of 
the ER. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced some 
homology between the PR0836 amino acid sequence and at least the following Dayhoff sequences, S58132, 
SPBC3B9J, S66714, CRU40057J and IMACAEEL. 

30 

EXAMPLE 98 : Isolation of cDNA clones Encoding Human PROl 141 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 11873. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
35 and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
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in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56518. 

In light of an observed sequence homology between the DNA56518 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2679995, the Incyte EST clone 2679995 was purchased 
5 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 219 and is herein designated as DNA59625-1498. 

Clone DNA59625-1498 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 204-206 and ending at the stop codon at nucleotide positions 945-947 (Figure 219). 
The predicted polypeptide precursor is 247 amino acids long (Figure 220). The full-length PROl 141 protein 

10 shown in Figure 220 has an estimated molecular weight of about 26,840 daltons and a pi of about 8.19. Analysis 
of the full-length PROl 141 sequence shown in Figure 220 (SEQ ID NO: 303) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 19 and transmembrane domains from 
about amino acid 38 to about amino acid 57, from about amino acid 67 to about amino acid 83, from about 
amino acid 117 to about amino acid 139 and from about amino acid 153 to about amino acid 170. Clone 

15 DNA59625-1498 has been deposited with ATCC on June 16, 1998 and is assigned ATCC deposit no. 209992. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 220 (SEQ ID NO:303), evidenced significant 
homology between the PROl 141 amino acid sequence and the following Dayhoff sequences: CEVF36H2L_2, 
PCRB7PRJJ, AB000506_1, LEU95008J, MRU87980J5, YIGM_ECOLI, STU65700J, GHU62778J, 

20 CYSTSYNY3 and AF009567_1. 

EXAMPLE 99 : Isolation of cDNA clones Encoding Human PROl 132 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein as DNA35934. Based on the DNA35934 
25 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO 1132. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer : 5 -TCCTGTGACCACCCCTCTAACACC-3' (SEQ ID NO:310) and 
30 reverse PCR primer : 5 , -CTGGAACATCTGCTGCCCAGATTC-3' (SEQ ID NO:31 1). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
sequence which had the following nucleotide sequence: 

5'-GTCGGATGACAGCAGCAGCCGCATCATCAATGGATCCGACTGCGATATGC-3' (SEQ ID NO:312). 
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 132 gene using the probe oligonucleotide and one of the PCR primers. RN A 
for construction of the cDNA libraries was isolated from human fetal kidney. 

456 



WO 99/63088 



PCT/US99/12252 



DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROl 132 and the derived protein sequence for PROl 132. 

The entire nucleotide sequence of PROl 132 is shown in Figure 225 (SEQ ID NO:308). Clone 
DNA59767-1489 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 354-356 and a stop codon at nucleotide positions 1233-1235 (Figure 225; SEQ ID NO:308). The 
5 predicted polypeptide precursor is 293 amino acids long. The signal peptide is at about amino acids 1-22 and 
the histidine active site is at about amino acids 104-109 of SEQ ID NO:309. Clone DNA59767-1489 has been 
deposited with ATCC (having the actual sequence rather than representations based on sequencing techniques 
as presented herein) and is assigned ATCC deposit no. 203108. The ruil-Iength PROl 132 protein shown in 
Figure 226 has an estimated molecular weight of about 32,020 daltons and a pi of about 8.7. 
10 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 226 (SEQ ID NO:309), revealed sequence identity 
between the PROl 132 amino acid sequence and the following Dayhoff sequences: SSU76256 1, P W10694, 
MMAE000663_6, AF013988J, U66061J, MMAE000665_2, MMAE00066415, MMAE00066414, 
MMAE000665_4 and MMAE00066412. 

15 

EXAMPLE 100 : Isolation of cDNA clones Encoding Human NL7 (PRO 1 346) 

A single EST sequence (#1398422) was found in the LIFESEQ* database as described in Example 1 
above. This EST sequence was renamed as DNA45668. Based on the DNA45668 sequence, oligonucleotides 
were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use 
20 as probes to isolate a clone of the full-length coding sequence for NL7. 
PCR primers (forward and reverse) were synthesized: 
forward PCR primer : 5 1 -C AC ACGTCC A ACCTC A ATGGGC AG-3 ' (SEQ ID NO:315) 
reverse PCR primer : 5'-GACCAGCAGGGCCAAGGACAAGG-3' (SEQ ID NO:316) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
25 DNA45668 sequence which had the following nucleotide sequence: 
hybridization probe: 

5 , -GTTCTCTGAGATGAAGATCCGGCCGGTCCGGGAGTACCGCTTAG-3' 
(SEQ ID NO:317) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
30 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the NL7 gene using the probe oligonucleotide and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from a human fetal kidney library (LIB227). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for NL7 
(designated herein as DNA59776-1600 [Figure 227, SEQ ID NO:3 13]) and the derived protein sequence for NL7 
35 (PRO 1346). 

The entire coding sequence of NL7 (PR01346) is shown in Figure 227 (SEQ ID NO:313). Clone 
DNA59776-1600 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 1-3 and an apparent stop codon at nucleotide positions 1 384-1386. The predicted polypeptide precursor 
is 461 amino acids long. The protein contains an apparent type II transmembrane domain at amino acid positions 
from about 31 to about 50; fibrinogen beta and gamma chains C-terminal domain signature starting at about 
amino acid position 409, and a leucine zipper pattern starting at about amino acid positions 140, 147, 154 and 
161, respectively. Clone DNA59776-1600 has been deposited with ATCC and is assigned ATCC deposit no. 
5 203128. The full-length NL7 protein shown in Figure 228 has an estimated molecular weight of about 50,744 
daltons and a pi of about 6.38. 

Based on a WU-BLAST2 sequence alignment analysis (using the WU-BLAST2 computer program) of 
the full-length sequence, NL7 shows significant amino acid sequence identity to a human microfibril-associated 
glycoprotein (1 MFA4_HUMAN); to known TIE-2 ligands and ligand homologues, ficolin, serum lectin and 
10 TGF-1 binding protein. 

EXAMPLE 101: Isolation of cDNA clones Encoding Human PROl 131 

A cDNA sequence isolated in the amylase screen described in Example 2 above is herein designated 
DNA43546 (see Figure 231; SEQ ID NO:320). The DNA43546 sequence was then compared to a variety of 

15 expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with 

20 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA45627. 

Based on the DNA45627 sequence, oligonucleotide probes were generated and used to screen a human 
library prepared as described in paragraph 1 of Example 2 above. The cloning vector was pRK5B (pRK5B is 
a precursor of pRK5D that does not contain the Sfil site; see, Holmes et ah, Science 253: 1278-1280 (1991)), 

25 and the cDNA size cut was less than 2800 bp. 

PCR primers (forward and 2 reverse) were synthesized: 
forward PCR primer 5 1 -ATGC AGGCC AAGT AC AGCAGC AC-3 ' (SEQ ID NO:321); 
reverse PCR primer 1 5 , -CATGCTGACGACTTCCTGCAAGC-3 ' (SEQ ID NO:322); and 
reverse PCR primer 1 5 ' -CC AC AC AGTCTCTGCTTCTTGGG-3 ' (SEQ ID NO:323) 

30 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA45627 

sequence which had the following nucleotide sequence: 
hybridization probe 

5 '-ATGCTGGATGATGATGGGGACACC ACCATG AGCCTGC ATT-3 ' (SEQ ID NO:324). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 131 gene using the probe oligonucleotide and one of the PCR primers. 
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A full length clone was identified that contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 144-146, and a stop signal at nucleotide positions 984-986 
(Figure 229; SEQ ID NO:318). The predicted polypeptide precursor is 280 amino acids long, has a calculated 
molecular weight of approximately 31,966 daltons and an estimated pi of approximately 6.26. The 
transmembrane domain sequence is at about 49-74 of SEQ ID NO:319 and the region having sequence identity 
5 with LDL receptors is about 50-265 of SEQ ID NO:319. PROl 131 contains potential N-linked glycosylation 
sites at amino acid positions 95-98 and 169-172 of SEQ ID NO:3 19. Clone DNA59777-1480 has been deposited 
with the ATCC and is assigned ATCC deposit no. 203111. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown inFigure 230 (SEQ ID NO:319), evidenced some sequence 
10 identity between the PROl 13 1 amino acid sequence and the following Dayhoff sequences: AB0107101 , 149053, 
149115, RNU56863J, LY4A_MOUSE, 155686, MMU56404J, 149361, AF030313J and MMU09739J. 

EXAMPLE 102 : Isolation of cDNA clones Encoding Human PRO 1281 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. This consensus sequence is designated herein as DNA35720. Based on the DNA35720 

sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 

of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO 1281 . 
PCR primers (forward and reverse) were synthesized: 

forward PCR primers : 
20 5*-TGGAAGGCTGCCGCAACGACAATC-3 1 (SEQ ID NO:327); 

5 ' -CTG ATGTGGCCG ATGTTCTG-3 ' (SEQ ID NO:328); and 

5 ' -ATGGCTC AGTGTGC AGAC AG-3 ' (SEQ ID NO:329). 

reverse PCR primers : 

5 '-GCATGCTGCTCCGTGAAGTAGTCC-3 ' (SEQ ID NO:330); and 
25 5'-ATGCATGGGAAAGAAGGCCTGCCC-3' (SEQ ID NO:331). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA35720 sequence 
which had the following nucleotide sequence: 
hybridization probe: 

S'-TGCACTGGTGACCACGAGGGGGTGCACTATAGCCATCTGGAGCTGAG-S' (SEQ ID NO:332). 
30 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 

to isolate clones encoding the PR01281 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated human fetal liver. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
35 PR01281 (designated herein as DNA59820-1549 [Figure 232, SEQ ID N0:325]; and the derived protein 

sequence for PR01281. 



459 



WO 99/63088 



PCT/US99/12252 



The entire coding sequence of PR01281 is shown in Figure 232 (SEQ ID NO:325). Clone DNA59820- 
1549 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
228-230 and an apparent stop codon at nucleotide positions 2553-2555. The predicted polypeptide precursor 
is 775 amino acids long. The full-length PR01281 protein shown in Figure 233 has an estimated molecular 
weight of about 85,481 daltons and a pi of about 6.92. Additional features include a signal peptide at about 
5 amino acids 1-15; and potential N-glycosylation sites at about amino acids 138-141 and 361-364. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 233 (SEQ ID NO:326), revealed some sequence 
identity between the PR01281 amino acid sequence and the following Dayhoff sequences: S44860, CET24D1J , 
CEC38H2J, CAC2_HAECO, B3A2_HUMAN, S22373, CEF38A3_2, CEC34F6J, CEC34F63, and 
10 CELT22B11J. 

Clone DNA59820-1549 has been deposited with ATCC and is assigned ATCC deposit no. 203129. 

EXAMPLE 103: Isolation of cDNA clones Encoding Human PRQ1Q64 

A cDNA sequence isolated in the amylase screen described in Example 2 above was found, by the WU- 

15 BLAST2 sequence alignment computer program, to have no significant sequence identity to any known human 
protein. This cDNA sequence is herein designated DNA45288. The DNA45288 sequence was then compared 
to various EST databases including public EST databases (e.g., GenBank), and a proprietary EST database 
(LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST sequences. The comparison 
was performed using the computer program BLAST or BLAST2 (Altschul et al., Methods in Enzymology, 

20 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). This consensus sequence is 
herein designated DNA48609. Oligonucleotide primers based upon the DNA48609 sequence were then 
synthesized and employed to screen a human fetal kidney cDNA library which resulted in the identification of 

25 the DNA59827-1426 clone shown in Figure 234. The cloning vector was pRK5B (pRK5B is a precursor of 
pRK5D that does not contain the Sfil site; see, Holmes et al., Science . 253:1278-1280 (1991)), and the cDNA 
size cut was less than 2800 bp. 

The oligonucleotide probes employed were as follows: 
forward PCR primer 5'-CTGAGACCCTGCAGCACCATCTG-3' (SEQ ID NO:336) 

30 reverse PCR primer 5 ' -GGTGCTTCTTG AGCCCC ACTT AGC-3 * (SEQ ID NO:337) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA48609 sequence which had the following nucleotide sequence 
hybridization probe 

5'-AATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCTGT-3' (SEQ ID NO:338) 
35 A full length clone was identified that contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 532-534 and a stop signal at nucleotide positions 991-993 
(Figure 234, SEQ ID NO:333). The predicted polypeptide precursor is 153 amino acids long, has a calculated 
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molecular weight of approximately 17,3 17 daltons and an estimated pi of approximately 5. 17. Analysis of the 
full-length PRO1064 sequence shown in Figure 235 (SEQ ID NO:334) evidences the presence of the following: 
a signal peptide from about amino acid i to about amino acid 24, a transmembrane domain from about amino 
acid 89 to about amino acid 1 10, an indole-3-gIycerol phosphate synthase homology block from about amino acid 
74 to about amino acid 105 and a Myb DNA binding domain protein repeat protein homology block from about 
5 amino acid 1 14 to about amino acid 137. Clone DNA59827-1426 has been deposited with ATCC on August 4, 
1998 and is assigned ATCC deposit no. 203089. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 235 (SEQ ID NO:334), evidenced homology 
between the PRO1064 amino acid sequence and the following Dayhoff sequences: MMNP15PRO_l, 
10 BP187PLYHJ , CELF42G8_4,MMU58888_1,GEN14270, TUB8_SOLTU, RCN _MOUSE,HUMRBSY79 J , 
SESENODAJ and A21467J. 

EXAMPLE 104 : Isolation of cDNA clones Encoding Human PRO 1379 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. This consensus sequence is designated herein DNA45232. Based on the DNA45232 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01379. 

PCR primers (forward and reverse) were synthesized: 
20 forward PCR primer 5 ' -TGG AC ACCGT ACCCTGGT ATCTGC -3 ' (SEQ ID NO:341) 
reverse PCR primer 5 '-CCAACTCTGAGGAGAGCAAGTGGC-3 ' (SEQ ID NO:342) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA45232 sequence which had the following nucleotide sequence: 
hybridization probe 

25 5'-TGTATGTGCACACCCTCACCATCACCTCCAAGGGCAAGGAGAAC-3* (SEQ ID NO:343). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PR01379 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated human fetal kidney tissue. 
30 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR01379 which is designated herein as DNA59828-1608 and shown in Figure 237 (SEQ ID NO:339); and the 

derived protein sequence for PR01379 (SEQ ID NO:340). 

The entire coding sequence of PR01379 is shown in Figure 237 (SEQ ID NO:339). Clone DNA59828- 

1608 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
35 10-12 and an apparent stop codon at nucleotide positions 1732-1734. The predicted polypeptide precursor is 574 

amino acids long. The full-length PRO 1379 protein shown in Figure 238 has an estimated molecular weight of 

about 65,355 daltons and a pi of about 8.73. Additional features include a signal peptide at about amino acids 
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1-17 and potential N-glycosylation sites at about amino acids 160-163, 287-290, and 323-326. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 238 (SEQ ID NO:340), revealed some homology 
between the PRO 1379 amino acid sequence and the following Dayhoff sequences: YHY8_YEAST, AF040625_1 , 
HP714394J, and HIV18U45630J. 
5 Clone DNA59828-1608 has been deposited with ATCC and is assigned ATCC deposit no. 203158. 

EXAMPLE 105: Isolation of cDNA Clones Encoding Human PRQ844 

An expressed sequence tag (EST) DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, C A) 
was searched and an EST was identified which showed sequence identity with aLP. Based on the information 
10 and discoveries provided herein, the clone for this EST, Incyte clone no. 2657496 from a cancerous lung library 
was further examined. 

DNA sequencing of the insert for this clone gave a sequence (herein designated as DNA59838-1462; 
SEQ ID NO:344) which includes the full-length DNA sequence for PR0844 and the derived protein sequence 
for PR0844. 

15 The entire nucleotide sequence of DNA59838-1462 is shown in Figure 239 (SEQ ID NO:344). Clone 

DNA59838-1462 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 5-7 and ending at the stop codon at nucleotide positions 338-340 of SEQ ID NO:344 (Figure 239). 
The predicted polypeptide precursor is 111 amino acids long (Figure 240). The full-length PR0844 protein 
shown in Figure 240 has an estimated molecular weight of about 12,050 dahons and a pi of about 5.45. Clone 

20 UNQ544 DNA59838-1462 has been deposited with ATCC on June 16, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0844 polypeptide suggests that it possesses 
significant sequence similarity to serine protease inhibitors, thereby indicating that PR0844 may be a novel 
25 proteinase inhibitor. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PR0844 amino acid sequence and at least the following Dayhoff 
sequences, ALK1_HUMAN, PJ>82403, P P82402, ELAF HUMAN and P_P60950. 

EXAMPLE 106: Isolation of cDNA Clones Encoding Human PRQ848 

30 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 

35 EnzvmologY 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA55999. 

In light of an observed sequence homology between the DNA55999 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2768571, the Incyte EST clone 2768571 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 241 and is herein designated as DNA59839-1461. 
5 The entire nucleotide sequence of DNA59839-1461 is shown in Figure 241 (SEQ ID NO:346). Clone 

DNA59839-1461 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 146-148 and ending at the stop codon at nucleotide positions 1946-1948 of SEQ ID NO:346 (Figure 
241). The predicted polypeptide precursor is 600 amino acids long (Figure 242). The full-length PR0848 
protein shown in Figure 242 has an estimated molecular weight of about 68,536 daltons. Clone DNA59839- 1461 

10 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone has the actual nucleic 
acid sequence and that the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0848 polypeptide suggests that it may be a 
novel sialyltransferase. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced sequence identity between the PR0848 amino acid sequence and at least the following Dayhoff 

15 sequences, P R78619 (GalNAc-alpha-2, 6-sialyltransferase), CAAG5_CHICK (alpha-n-acetylgalactosamide 
alpha-2,6-sialytransferase),HSU14550_l, CAG6 HUMAN and PR63217 (human alpha-2, 3-sialyltransferase). 

EXAMPLE 107: Isolation of cDNA Clones Enco ding Human PRO 1097 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

20 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

25 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56006. 

In light of an observed sequence homology between the DNA56006 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2408105, the Incyte EST clone 2408105 was purchased 

30 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 243 and is herein designated as DNA5984 1-1460. 

The entire nucleotide sequence of DNA59841-1460 is shown in Figure 243 (SEQ ID NO:348). Clone 
DNA5984 1-1460 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 3-5 and ending at the stop codon at nucleotide positions 276-278 of SEQ ID NO:348 (Figure 243). 

35 The predicted polypeptide precursor is 91 amino acids long (Figure 244). The full-length PRO1097 protein 
shown in Figure 244 has an estimated molecular weight of about 10,542 daltons and a pi of about 10.04. Clone 
DNA59841-1460 has been deposited with ATCC on July 1, 1998. It is understood that the deposited clone has 
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the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 244, the signal peptide is at about amino acids 1-20 of SEQ ID NO:349. The 
glycoprotease family protein domain starts at about amino acid 56, and the acyltransferase ChoActase/COT/CPT 
family peptide starts at about amino acid 49 of SEQ ID NO:349. 

5 

EXAMPLE 108:Isolation of cDNA clones Encoding Human PRQ1153 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 

10 EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

15 obtained therefrom is herein designated DNA56008. 

In light of an observed sequence homology between the DNA56008 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2472409, the Incyte EST clone 2472409 was purchased 
and the cDNA insert was obtained and sequenced. It was found thai this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 245 and is herein designated as DNA59842-1502. 

20 The full length clone shown in Figure 245 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 92-94 and ending at the stop codon found at nucleotide positions 
683-685 (Figure 245; SEQ ID NO:350). The predicted polypeptide precursor (Figure 246, SEQ ID NO:351) 
is 197 amino acids long. PROl 153 has a calculated molecular weight of approximately 21 ,540 daltons and an 
estimated pi of approximately 8.31. Clone DNA59842-1502 has been deposited with ATCC and is assigned 

25 ATCC deposit no. 209982. It is understood that the correct and actual sequence is in the deposited clone while 
herein are present representations based on current sequencing techniques which may have minor errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence, PROl 153 shows some amino acid sequence identity to the following Dayhoff designations: 
S57447; SOYHRGPCJ; S46965; PJ>82971; VCPHEROPH 1; EXTNTOBAC; MLCB2548_9; 

30 ANXA_RABIT; JC5437 and SSGP_VOLCA. 

EXAMPLE 109: Isolation of cDNA clones Encoding Hu man PROl 154 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
35 expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
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Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DN A sequence with 
the program tt phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56025. 

In light of an observed sequence homology between the DNA56025 consensus sequence and an EST 
5 sequence encompassed within the Incyte EST clone no. 2169375, the Incyte EST clone 2169375 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 247 and is herein designated as DNA59846-1503. 

The full length clone shown in Figure 247 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 86-88 and ending at the stop codon found at nucleotide positions 
10 2909-291 1 (Figure 247; SEQ ID NO:352). The predicted polypeptide precursor (Figure 248, SEQ ID NO:353) 
is 941 amino acids long. PRO 1 154 has a calculated molecular weight of approximately 107,144 daltons and an 
estimated pi of approximately 6.26. Clone DNA59846-1503 has been deposited with ATCC and is assigned 
ATCC deposit no. 209978. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
15 length sequence, PROl 154 shows sequence identity to at least the following Dayhoff designations: AB01 1097 1 , 
AMPN_HUMAN, RNU76997J, 159331, GEN14047, HSU62768J ,P_R51281,CET07F10J, SSU66371J, 
and AMPRE_HUMAN. 

EXAMPLE 110 : Isolation of cDNA clones Encoding Human PROl 181 

20 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database, designated herein as 82468. This EST cluster sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 

25 (Altshul et al. , Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA56029. 

In light of an observed sequence homology between the DNA56029 consensus sequence and an EST 

30 sequence encompassed within the Incyte EST clone no. 2186536, the Incyte EST clone 2186536 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 249 and is herein designated as DNA59847-1511. 

Clone DNA59847-1511 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 17-19 and ending at the stop codon at nucleotide positions 1328-1330 (Figure 249). 

35 The predicted polypeptide precursor is 437 amino acids long (Figure 250). The full-length PROl 181 protein 
shown in Figure 250 has an estimated molecular weight of about 46,363 daltons and a pi of about 6.22. Analysis 
of the full-length PROl 181 sequence shown in Figure 250 (SEQ ID NO:355) evidences the presence of the 
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following: a signal peptide from about amino acid 1 to about amino acid 15, potential N-glycosylation sites from 
about amino acid 46 to about amino acid 49, from about amino acid 189 to about amino acid 192 and from about 
amino acid 382 to about amino acid 385 and amino acid sequence blocks having homology to Ly-6/u-PAR 
domain proteins from about amino acid 287 to about amino acid 300 and from about amino acid 98 to about 
amino acid 111. Clone DNA59847-1511 has been deposited with ATCC on August 4, 1998 and is assigned 
5 ATCC deposit no. 203098. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 250 (SEQ ID NO:355), evidenced homology 
between the PRO 1181 amino acid sequence and the following Dayhoff sequences: AF041083_1, PW26579, 
RNMAGPIANJ, CELT13C2_2, LMSAP2GNJ, S61882, CEF35C5J2, DP87_DICDI, GIU47631J and 
10 P_R07092. 

EXAMPLE 111: Isolation of cDNA clones Encoding Human PRO 1 182 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database, designated herein as 146647. This EST cluster sequence was 

15 then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ® Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 

20 assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56033, 

In light of an observed sequence homology between the DNA56033 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2595195, the Incyte EST clone 2595195 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

25 The sequence of this cDNA insert is shown in Figure 251 and is herein designated as DNA59848-1512. 

,. Clone DNA59848-1512 contains a single open reading frame with an apparent translational initiation 

site at nucleotide positions 67-69 and ending at the stop codon at nucleotide positions 880-882 (Figure 251). The 
predicted polypeptide precursor is 271 amino acids long (Figure 252). The full-length PROl 182 protein shown 
in Figure 252 has an estimated molecular weight of about 28,665 daltons and a pi of about 5.33. Analysis of 

30 the full-length PROl 182 sequence shown in Figure 252 (SEQ ID NO:357) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 25, an amino acid block having 
homology to C-type lectin domain proteins from about amino acid 247 to about amino acid 256 and an amino 
acid sequence block having homology to Clq domain proteins from about amino acid 44 to about amino acid 
77. Clone DNA59848-1512 has been deposited with ATCC on August 4, 1998 and is assigned ATCC deposit 

35 no. 203088. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 252 (SEQ ID NO:357), evidenced significant 
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homology between the PRO 11 82 amino acid sequence and the following Dayhoff sequences: PSPD BOVIN, 
CL43_BOVIN, CONG_BOVIN, PJV18780, P_R45005, P_R53257 and CELEGAP7J. 

EXAMPLE 112: Isolation of cDNA clones Encoding Human PRQ1155 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
5 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 

EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 

Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
10 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 

the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

obtained therefrom is herein designated DNA56102. 

In light of an observed sequence homology between the DNA56102 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2858870, the Incyte EST clone 2858870 was purchased 
15 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

The sequence of this cDNA insert is shown in Figure 253 and is herein designated as DNA59849-1504. 

The full length clone shown in Figure 253 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 158-160 and ending at the stop codon found at nucleotide 

positions 563-565 (Figure 253; SEQ ID NO:358). The predicted polypeptide precursor (Figure 254, SEQ ID 
20 NO:359) is 135 amino acids long. PR01155 has a calculated molecular weight of approximately 14,833 daltons 

and an estimated pi of approximately 9.78. Clone DNA59849-1504 has been deposited with ATCC and is 

assigned ATCC deposit no. 209986. It is understood that the actual clone has the correct sequence whereas 

herein are only representations which are prone to minor sequencing errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
25 length sequence, PR01155 shows some amino acid sequence identity with the following Dayhoff designations: 

TKNK_BOVIN; PVB19X587J; AF019049J; P_W00948; S72864; P__W00949; 162742; AF038501 1; 

TKNG_HUMAN; and YATl_RHOBL. Based on the information provided herein, PROH55 may play a role 

in providing neuroprotection and cognitive enhancement. 

30 EXAMPLE 113 : Isolation of cDNA clones Encoding Human PROl 156 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database, designated herein as 138851. This EST cluster sequence was 
then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 

35 identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al,, Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
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assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56261. 

In light of an observed sequence homology between the DNA56261 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3675191, the Incyte EST clone 3675191 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
5 The sequence of this cDNA insert is shown in Figure 255 and is herein designated as DNA59853-1505. 

The full length clone shown in Figure 255 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 212-214 and ending at the stop codon found at nucleotide 
positions 689-691 (Figure 255; SEQ ID NO:360). The predicted polypeptide precursor (Figure 256, SEQ ID 
NO:361)is 159 amino acids long. PRO 11 56 has a calculated molecular weight of approximately 17,476 daltons, 
10 an estimated pi of approximately 9.15, a signal peptide sequence at about amino acids 1 to about 22, and 
potential N-glycosylation sites at about amino acids 27-30 and 41-44. 

Clone DNA59853-1505 was deposited with the ATCC on June 16, 1998 and is assigned ATCC deposit 
no. 209985. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
15 alignment analysis (using the ALIGN computer program) of the full-length sequence shown in Figure 256 (SEQ 
ID NO:361), revealed some homology between the PROl 156 amino acid sequence and the following Dayhoff 
sequences: D45027J, P_R79914, JC5309, KBF2HUMAN, AF010144J, GEN14351, S68681, P_R79915, 
ZMTACJ, and HUMCPGOJ. 

20 EXAMPLE 114: Isolation of cDNA Clones Encoding Human PRO1098 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

25 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56377. 

30 In light of an observed sequence homology between the DNA56377 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3050917, the Incyte EST clone 3050917 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 257 and is herein designated as DNA59854-1459. 

The entire nucleotide sequence of DNA59854-1459 is shown in Figure 257 (SEQ ID NO:362). Clone 

35 DNA59854-1459 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 58-60 and ending at the stop codon at nucleotide positions 292-294 of SEQ ID NO:362 (Figure 257). 
The predicted polypeptide precursor is 78 amino acids long (Figure 258). The full-length PRO1098 protein 
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shown in Figure 258 has an estimated molecular weight of about 8,396 daltons and a pi of about 7.66. Clone 
DNA59854-1459 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone 
has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 258, a signal peptide appears to be at about amino acids 1-19 of SEQ ID NO:363, 
5 an N-glycosylation site appears to be at about amino acids 37-40 of SEQ ID NO:363, and N-myristoylauon sites 
appear to be at about 15-20, 19-24 and 60-65 of SEQ ID NO:363. 

EXAMPLE 115 : Isolation of cDNA clones Encoding Human PROl 127 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

10 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

15 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57959. 

In light of an observed sequence homology between the DNA57959 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 685126, the Merck EST clone 685126 was purchased 

20 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 259 and is herein designated as DNA60283-1484. 

The full length clone shown in Figure 259 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 126-128 and ending at the stop codon found at nucleotide 
positions 327-329 (Figure 259; SEQ ID NO:364). The predicted polypeptide precursor (Figure 260, SEQ ID 

25 NO:365) is 67 amino acids long including a signal peptide at about 1-29 of SEQ ID NO:365. PROl 127 has a 
calculated molecular weight of approximately 7,528 daltons and an estimated pi of approximately 4.95. Clone 
DNA60283-1484 was deposited with the ATCC on July 1, 1998 and is assigned ATCC deposit no. 203043. 
It is understood that the deposited clone has the actual sequence, whereas representations which may have minor 
sequencing errors are presented herein. 

30 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 260 (SEQ ID NO:365), revealed some homology 
between the PROH27 amino acid sequence and the following Dayhoff sequences: AF037218 48, PW09638, 
HBA_HETPO, S39821, KR2_EBV, CET20D3J, HCU37630J, HS193B12J0, S40012 and TRITUBCJ. 

35 
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EXAMPLE 116: Isolation of cDNA clones Encoding Hu man PRO 1 126 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ® Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program u phrap w (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56250. 

In light of an observed sequence homology between the DNA56250 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 1437250, the Incyte EST clone 1437250 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 261 and is herein designated as DNA60615-1483. 

Clone DNA60615-1483 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 1 10-1 12 and ending at the stop codon at nucleotide positions 1316-13 1 8 (Figure 261). 
The predicted polypeptide precursor is 402 amino acids long (Figure 262). The full-length PROl 126 protein 
shown in Figure 262 has an estimated molecular weight of about 45,92 1 daltons and a pi of about 8.60. Analysis 
of the full-length PROl 126 sequence shown in Figure 262 (SEQ ID NO:367) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 25 and potential N-glycosylation sites 
from about amino acid 66 to about amino acid 69, from about amino acid 1 38 to about amino acid 141 and from 
about amino acid 183 to about amino acid 186. Clone DNA60615-1483 has been deposited with ATCC on June 
16, 1998 and is assigned ATCC deposit no. 209980. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 262 (SEQ ID NO:367), evidenced significant 
homology between the PROl 126 amino acid sequence and the following Dayhoff sequences: 173636, 
NOMR_HUMAN, MMUSMYOC3J, HS454G6J, P_R98225, RNU78105J, RNU72487J, AF035301J, 
CEELC48E7_4 and CEF1 1C3_3. 

EXAMPLE 117 : Isolation of cDNA clones Encoding Human PROl 125 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA56540. 

In light of an observed sequence homology between the DNA56540 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 14861 14, the Incyte EST clone 14861 14 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 263 and is herein designated as DNA60615-1483. 
5 The full length clone shown in Figure 263 contained a single open reading frame with an apparent 

translation^ initiation site at nucleotide positions 47-49 and ending at the stop codon found at nucleotide positions 
1388-1390 (Figure 263; SEQ ID NO:368). The predicted polypeptide precursor (Figure 264, SEQ ID NO:369) 
is 447 amino acids long. PR01125 has a calculated molecular weight of approximately 49,798 daltons and an 
estimated pi of approximately 9.78. Clone DNA60619-1482 has been deposited with ATCC and is assigned 

10 ATCC deposit no. 209993. It is understood that the clone has the actual sequence and that the sequences herein 
are representations based on current techniques which may be prone to minor errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence, PRO 1125 shows some sequence identity with the following Dayhoff designations: 
RC01_NEUCR; S58306; PKWAJTHECU; S76086; P_R8588L; HET1PODAN; SPU92792J; 

15 APAF_HUMAN; S76414 and S59317, 

EXAMPLE 118 : Isolation of cDNA clones Encoding Human PRO 11 86 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

20 expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 

25 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56748. 

In light of an observed sequence homology between the DNA56748 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3476792, the Incyte EST clone 3476792 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

30 The sequence of this cDNA insert is shown in Figure 265 and is herein designated as DNA60621-1516. 

The full length clone shown in Figure 265 contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 9 1 -93 and ending at the stop codon found at nucleotide positions 
406-408 (Figure 265; SEQ ID NO:370). The predicted polypeptide precursor (Figure 266, SEQ ID NO:371) 
is 105 amino acids long. The signal peptide is at amino acids 1-19 of SEQ ID NO:371. PROH86 has a 

35 calculated molecular weight of approximately 1 1,715 daltons and an estimated pi of approximately 9.05. Clone 
DNA60621-1516 was deposited with the ATCC on August 4, 1998 and is assigned ATCC deposit no. 203091. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 266 (SEQ ID NO:371), revealed some sequence 
identity between the PR01186 amino acid sequence and the following Dayhoff sequences: VPRA_DENPO, 
LFE4_CHICK, AF034208J, AF030433J, A55035, COL_RABIT, CELB0507_9, S67826J, S34665 and 
CRU73817J. 

5 

EXAMPLE 119 : Isolation of cDNA clones Encoding Human PRO 1 198 

An initial DNA sequence referred to herein as DNA52083 was identified using a yeast screen in a 
human umbilical vein endothelial cell cDNA library that preferentially represents the 5* ends of the primary 
cDNA clones. DNA52083 was compared to ESTs from public databases (e.g., GenBank), and a proprietary 

10 EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA), using the computer program BLAST or 
BLAST2 [Altschul et al., Methods in Enzvmologv. 266:460^80 (1996)]. The ESTs were clustered and 
assembled into a consensus DNA sequence using the computer program "phrap" (Phil Green, University of 
Washington, Seattle, Washington). One or more of the ESTs was obtained from human breast skin tissue 
biopsy. This consensus sequence is designated herein as DNA52780. 

15 In light of an observed sequence homology between the DNA52780 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3852910, the Incyte EST clone 3852910 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 267 and is herein designated as DNA60622-1525. 

The full length DNA60622-1525 clone shown in Figure 267 (SEQ ID NO:372) contained a single open 

20 reading frame with an apparent translational initiation site at nucleotide positions 54 to 56 and ending at the stop 
codon found at nucleotide positions 741 to 743. The predicted polypeptide precursor, which is shown in Figure 
268 (SEQ ID NO:373), is 229 amino acids long. PRO 1 198 has a calculated molecular weight of approximately 
25,764 daltons and an estimated pi of approximately 9.17. There is a signal peptide sequence at about amino 
acids 1 through 34. There is sequence identity with glycosyl hydrolases family 31 protein at about amino acids 

25 142 to about 175. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 268 (SEQ ID NO: 373), revealed some homology 
between the PR01198 amino acid sequence and the following Dayhoff sequences: ATF6H116, UCRIRAT, 
TOBSUP2NTJ, RCUERF3J, AMU88186J, P_W22485, S56579, AF040711J, DPP4 PIG. 
30 Clone DNA60622-1525 was been deposited with the ATCC on August 4, 1998, and is assigned ATCC 

deposit no. 203090. 

EXAMPLE 120 : Isolation of cDNA clones Encoding Human PRQ1158 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
35 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
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homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57248. 
5 In light of an observed sequence homology between the DNA57248 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2640776, the Incyte EST clone 2640776 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 269 and is herein designated as DNA60625-1507. 

The full length clone shown in Figure 269 contained a single open reading frame with an apparent 

10 translational initiation site at nucleotide positions 163 to 165 and ending at the stop codon found at nucleotide 
positions 532 to 534 (Figure 269; SEQ ID NO:374). The predicted polypeptide precursor (Figure 270, SEQ 
ID NO:375) is 123 amino acids long, PR01158 has a calculated molecular weight of approximately 13,113 
daltons and an estimated pi of approximately 8.53. Additional features include a signal peptide sequence at about 
amino acids 1-19, a transmembrane domain at about amino acids 56-80, and a potential N-glycosylation site at 

15 about amino acids 36-39. Clone DNA60625-1507 was deposited with the ATCC on June 16, 1998 and is 
assigned ATCC deposit no. 209975. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 270 (SEQ ID NO:375), revealed some homology 
between the PR01158 amino acid sequence and the following Dayhoff sequences: ATAC00310510F18A8.10, 

20 P_R8515i, PHS2_SOLTU, RNMHCIBAC1, RNA1FMHCI, 168771, RNRT1A10G1, PTPA_HUMAN, 
HUMGACAJ, and CHKPTPAJ. 

EXAMPLE 121: Isolation of cDNA clones Encoding Human PRQ1159 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

25 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

30 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57221. 

In light of an observed sequence homology between the DNA57221 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 376776, the Incyte EST clone 376776 was purchased 

35 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 271 and is herein designated as DNA60627-1508. 
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Clone DNA60627-1508 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 92-94 and ending at the stop codon at nucleotide positions 362-364 (Figure 271). The 
predicted polypeptide precursor is 90 amino acids long (Figure 272). The full-length PRO 1 159 protein shown 
in Figure 272 has an estimated molecular weight of about 9,840 daltons and a pi of about 10. 13. Analysis of 
the full-length PROH59 sequence shown in Figure 272 (SEQ ID NO:377) evidences the presence of the 
5 following: a signal peptide from about amino acid 1 to about amino acid 15 and a potential N-glycosylation site 
from about amino acid 38 to about amino acid 41 . Clone DNA60627-1508 has been deposited with ATCC on 
August 4, 1998 and is assigned ATCC deposit no. 203092. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 272 (SEQ ID NO:377), evidenced significant 
10 homology between the PRO 1159 amino acid sequence and the following Dayhoff sequences: AF016494_6, 
AF036708_20, DSSCUTEJ, D89100J, S28060, MEFA_XENLA t AF020798J2,G70065,E64423,JQ2005. 

EXAMPLE 122: Isolation of cDNA clones Encoding Human PRO 1124 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

15 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

20 or greater mat did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56035. 

In light of an observed sequence homology between the DNA56035 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2767646, the Incyte EST clone 2767646 was purchased 

25 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 273 and is herein designated as DNA60629-1481. 

The full length clone shown in Figure 273 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 25-27 and ending at the stop codon found at nucleotide positions 
2782-2784 (Figure 273; SEQ ID NO:378). The predicted polypeptide precursor (Figure 274, SEQ ID NO:379) 

30 is 919 amino acids long. PR01124 has a calculated molecular weight of approximately 101,282 daltons and an 
estimated pi of approximately 5.37. Clone DNA60629-1481 has been deposited with the ATCC and is assigned 
ATCC deposit no. 209979. It is understood that the deposited clone has the actual sequence, whereas only 
representations based on current sequencing techniques which may include normal and minor errors, are 
provided herein. 

35 Based on a WU-BLAST2 sequence alignment analysis of the full-length sequence, PRO 11 24 shows 

significant amino acid sequence identity to a chloride channel protein and to ECAM-1. Specifically, the 
following Dayhoff designations were identified as having sequence identity with PROH24: ECLCBOVIN, 
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AF001261J, P_W06548, SSC6A10J, AF004355J, S76691, AF017642, BYU06866_2, CSA_DICDI and 
SAU47139 2. 

EXAMPLE 123 : Isolation of cDNA clones Encoding Human PRO 1287 

An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) 
5 was searched and an EST was identified which showed homology to the fringe protein. This EST sequence was 
then compared to various EST databases including public EST databases (e.g. , GenBank), and a proprietary EST 
database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identity homologous EST sequences. The 
comparison was performed using the computer program BLAST or BLAST2 [Altschul et al., Methods in 
Enzvmologv. 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 

10 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence 
with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). This consensus 
sequence obtained is herein designated DNA40568. 

Based on the DNA40568 consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR 
a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full- 

15 length coding sequence for PR01287. Forward and reverse PCR primers generally range from 20 to 30 
nucleotides and are often designed to give a PCR product of about 100-1000 bp in length. The probe sequences 
are typically 40-55 bp in length. In some cases, additional oligonucleotides are synthesized when the consensus 
sequence is greater than about 1-1 .5kbp. In order to screen several libraries for a full-length clone, DNA from 
the libraries was screened by PCR amplification, as per Ausubel et al., Current Protocols in Mole cular Biology. 

20 supra, with the PCR primer pair. A positive library was then used to isolate clones encoding the gene of interest 
using the probe oligonucleotide and one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 ' -CTCGGGG A AAGGG ACTTG ATGTTGG-3 ' (SEQ ID NO:382) 
reverse PCR primer 1 5 ' -GCG AAGGTG AGCCTCTATCTCGTGCC-3 1 (SEQ ID NO:383) 

25 reverse PCR primer 2 5 1 -C AGCCTAC ACGT ATTGAGG-3 ' (SEQ ID NO:384) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40568 
sequence which had the following nucleotide sequence 
hybridization probe 

5'-CAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCC-3* (SEQ ID NO:385). 

30 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 
to isolate clones encoding the PR01287 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human bone marrow tissue. The cDNA 
libraries used to isolated the cDNA clones were constructed by standard methods using commercially available 

35 reagents such as those from Invitrogen, San Diego, C A. The cDNA was primed with oligo dT containing a NotI 
site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
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pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al. , Science . 253:1278-1280 
(1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01287 (designated herein as DNA61755-1554 [Figure 275, SEQ ID NO:380]) and the derived protein 
sequence for PRO 1287. 

5 The entire nucleotide sequence of DNA61755-1554 is shown in Figure 275 (SEQ ID NO:380). The 

full length clone contained a single open reading frame with an apparent translational initiation site at nucleotide 
positions 655-657 and a stop signal at nucleotide positions 2251-2253 (Figure 275, SEQ ID NO:380). The 
predicted polypeptide precursor is 532 amino acids long, has a calculated molecular weight of approximately 
61 ,35 1 daltons and an estimated pi of approximately 8.77, Analysis of the full-length PR01287 sequence shown 

10 in Figure 276 (SEQ ID NO:381) evidences the presence of the following: a signal peptide from about amino acid 
1 to about amino acid 27 and potential N-glycosylation sites from about amino acid 315 to about amino acid 318 
and from about amino acid 324 to about amino acid 327. Clone DNA6 1755- 1554 has been deposited with 
ATCC on August 11, 1998 and is assigned ATCC deposit no. 2031 12. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

15 alignment analysis of the full-length sequence shown in Figure 276 (SEQ ID NO: 381), evidenced significant 
homology between the PR01287 amino acid sequence and the following Dayhoff sequences: CET24D11, 
EZRIJBOVIN, GGU19889J, CC3JTEAST, S74244, NALSJvlOUSE, MOES_PIG, S28660, S44860 and 
YNA4CAEEL. 

20 EXAMPLE 124 : Isolation of cDNA clones Encoding Human PRO 13 12 

DNA55773 was identified in a human fetal kidney cDNA library using a yeast screen that preferentially 
represents the 5' ends of the primary cDNA clones. Based on the DNA55773 sequence, oligonucleotides were 
synthesized for use as probes to isolate a clone of the full-length coding sequence for PR01312. 

The full length DNA6 1873- 1574 clone shown in Figure 277 (SEQ ID NO: 386) contained a single open 

25 reading frame with an apparent translational initiation site at nucleotide positions 7-9 and ending at the stop 
codon found at nucleotide positions 643-645. The predicted polypeptide precursor is 212 amino acids long 
(Figure 278, SEQ ID NO:387). PR01312 has a calculated molecular weight of approximately 24,024 daltons 
and an estimated pi of approximately 6.26. Other features include a signal peptide at about amino acids 1-14; 
a transmembrane domain at about amino acids 141-160, and potential N-glycosylation sites at about amino acids 

30 76-79 and 93-96. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 278 (SEQ ID NO: 387), revealed some homology 
between the PR01312 amino acid sequence and the following Dayhoff sequences: GCINTALPHJ, 
GIBMUC1A1, P R96298, AF001406J, PVU88874J, P_R85151, AF041409J, CELC50F2J7, C45875, 
35 and AB009510_21. 

Clone DNA61873-1574 has been deposited with ATCC and is assigned ATCC deposit no. 203132. 
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EXAMPLE 125 : Isolation of cDNA clones Encoding Human PROl 192 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein DNA35924. Based on the DNA35924 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
5 PROl 192. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer : 5 * -CCGAGGCC ATCTAGAGGCC AG AGC-3 * (SEQ ID NO:390) 
reverse PCR primer : 5 ' -AC AGGC AG AGCC AATGGCC AG AGC -3 ' (SEQ ID NO:391). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
10 DNA35924 sequence which had the following nucleotide sequence: 
hybridization probe: 

5 ' -G AG AGG ACTGCGGG AGTTTGGG ACCTTTGTGC AG ACGTGCTC ATG-3 * (SEQ ID NO:392). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
15 isolate clones encoding the PROl 192 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated from human fetal liver and spleen tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PROl 192 designated herein as DNA62814-1521 and shown in Figure 279 (SEQ ID NO:388); and the derived 

protein sequence for PROl 192 which is shown in Figure 280 (SEQ ID NO:389). 
20 The entire coding sequence of PROH92 is shown in Figure 279 (SEQ ID NO:388). Clone DNA62814- 

1521 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 

121-123 and an apparent stop codon at nucleotide positions 766-768. The predicted polypeptide precursor is 215 

amino acids long. The predicted polypeptide precursor has the following features: a signal peptide at about 

amino acids 1-21; a transmembrane domain at about amino acids 153-176; potential N-glycosylation sites at 
25 about amino acids 39-42 and 118-121; and homology with myelin P0 proteins at about amino acids 27-68 and 

99-128 of Figure 280. The full-length PROl 192 protein shown in Figure 280 has an estimated molecular weight 

of about 24,484 daltons and a pi of about 6.98. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 280 (SEQ ID NO:389), revealed homology 
30 between the PROl 192 amino acid sequence and the following Dayhoff sequences: GEN12838, MYPOHUMAN, 

AF049498J, GEN14531, P_W14146, HS46KDAJ, CINBRAT, OX2G_RAT, D87018J, and D86996_2. 
Clone DNA62814-1521 was deposited with the ATCC on August 4, 1998, and is assigned ATCC 

deposit no. 203093. 

35 
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EXAMPLE 126 : Isolation of cDNA clones Encoding Human PRO1160 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above This consensus sequence is herein designated DNA40650. Based on the DNA40650 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
5 PROl 160. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5-GCTCCCTGATCTTCATGTCACCACC-3' (SEQ ID NO:395) 
reverse PCR primer 5 ' -C AGGG AC AC ACTCT ACC ATTCGGG AG-3 ' (SEQ ID NO:396) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40650 
10 sequence which had the following nucleotide sequence 
hybridization probe 

5 * -CC ATCTTTCTGGTCTCTGCCC AG AATCCG AC AAC AGCTGCTC-3 ' (SEQ ID NO:397) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
15 isolate clones encoding the PROl 160 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated from human breast tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PROl 160 (designated herein as DNA62872-1509 [Figure 281, SEQ ID NO: 393]) and the derived protein 

sequence for PROl 160. 

20 The entire nucleotide sequence of DNA62872-1509 is shown in Figure 281 (SEQ ID NO:393). Clone 

DNA62872-1509 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 40-42 and ending at the stop codon at nucleotide positions 310-312 (Figure 281). The predicted 
polypeptide precursor is 90 amino acids long (Figure 282). The full-length PROl 160 protein shown in Figure 
282 has an estimated molecular weight of about 9,039 daltons and a pi of about 4.37. Analysis of the full-length 

25 PROl 160 sequence shown in Figure 282 (SEQ ID NO: 394) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 19 and a protein kinase C phosphorylation site from about 
amino acid 68 to about amino acid 70. Clone DNA62872-1509 has been deposited with ATCC on August 4, 
1998 and is assigned ATCC deposit no. 203100. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

30 alignment analysis of the full-length sequence shown in Figure 282 (SEQ ID NO:394), evidenced significant 
homology between the PROl 160 amino acid sequence and the following Dayhoff sequences: B30305, 
GEN13490, 153641, S53363, HA34 BRELC, SP96_DICDI, S36326, SSU51197J0, MUC1_XENLA, 
TCU32448J and AF000409J. 

35 EXAMPLE 127: Isolation of cDNA clones Encoding Human PROl 187 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
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expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, C A) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
5 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57726. 

In light of an observed sequence homology between the DNA57726 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 358563, the Incyte EST clone 358563 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

10 The sequence of this cDNA insert is shown in Figure 283 and is herein designated as DNA62876-1517. 

The full length clone shown in Figure 283 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 121-123 and ending at the stop codon found at nucleotide 
positions 481-483 (Figure 283; SEQ ID NO:398). The predicted polypeptide precursor (Figure 284, SEQ ID 
NO:399) is 120 amino acids long. The signal peptide is at about amino acids 1-17 of SEQ ID NO:399. 

15 PR01187 has a calculated molecular weight of approximately 12,925 daltons and an estimated pi of 
approximately 9.46. Clone DNA62876-1517 was deposited with the ATCC on August 4, 1998 and is assigned 
ATCC deposit no. 203095. It is understood that the deposited clone contains the actual sequence and that the 
representations herein may have minor sequencing errors. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

20 alignment analysis of the full-length sequence shown in Figure 284 (SEQ ID NO:399), revealed some sequence 
identity (and therefore some relation) between the PRO 11 87 amino acid sequence and the following Dayhoff 
sequences: MGNENDOBXJ, CELF41G3_9, AMPG STRLI, HSBBOVHERL2, LEEXTEN101, 
AF029958J and P W04957. 

25 EXAMPLE 128 : Isolation of cDNA clones Encoding Human PRQ1185 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identity existing homologies. The 

30 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56426. 

35 In light of an observed sequence homology between the DNA56426 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 328441 1, the Incyte EST clone 328441 1 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a rull-length protein. 
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The sequence of this cDNA insert is shown in Figure 285 and is herein designated as DNA62881-1515. 

The full length DNA6288 1-1515 clone shown in Figure 285 contained a single open reading frame with 
an apparent translational initiation site at nucleotide positions 4-6 and ending at the stop codon found at 
nucleotide positions 598-600 (Figure 285; SEQ ID NO:4O0). The predicted polypeptide precursor (Figure 286, 
SEQ ID NO:401) is 198 amino acids long. The signal peptide is at about amino acids 1-21 of SEQ ID NO:401 . 
5 PR01185 has a calculated molecular weight of approximately 22,105 daltons and an estimated pi of 
approximately 7.73. Clone DNA62881-1515 has been deposited with the ATCC and is assigned ATCC deposit 
no. 203096. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 286 (SEQ ID NO:401), revealed some sequence 
10 identity between the PROH85 amino acid sequence and the following Dayhoff sequences: TUP1_YEAST, 
AF041382J, MAOM_SOLTU, SPPBPHU9JJ41024, EPCPLCFAILJ , HSPLECJ, YKL4_CAEEL, 
A44643,TGU65922J. 

EXAMPLE 129 : Isolation of cDNA clones Encodine Human PRO 1345 
15 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is herein designated DNA47364. Based on the DNA47364 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01345. 

20 PCR primers (forward and reverse) were synthesized: 

forward PCR primer S'-CCTGGTTATCCCCAGGAACTCCGAC-S* (SEQ ID NO:404) 

reverse PCR primer 5'-CTCTTGCTGCTGCGACAGGCCTC-3 1 (SEQ ID NO:405) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA47364 

sequence which had the following nucleotide sequence 

25 hybridization probe 

5'-CGCCCTCCAAGACTATGGTAAAAGGAGCCTGCCAGGTGTCAATGAC-3' (SEQ ID NO:406) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 1345 gene using the probe oligonucleotide and one of the PCR primers. RNA 

30 for construction of the cDNA libraries was isolated from human breast carcinoma tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01345 (designated herein as DNA64852-1589 [Figure 287, SEQ ID NO:402J) and the derived protein 
sequence for PR01345. 

The entire nucleotide sequence of DNA64852-1589 is shown in Figure 287 (SEQ ID NO:402). Clone 
35 DNA64852-1589 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 7-9 or 34-36 and ending at the stop codon at nucleotide positions 625-627 (Figure 287). The predicted 
polypeptide precursor is 206 amino acids long (Figure 288). The full-length PR01345 protein shown in Figure 
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288 has an estimated molecular weight of about 23,190 daltons and a pi of about 9.40. Analysis of the full- 
length PR01345 sequence shown in Figure 288 (SEQ ID NO:403) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 3 1 or from about amino acid 10 to about amino acid 
31 and a C-type lectin domain signature sequence from about amino acid 176 to about amino acid 190. Clone 
DNA64852-1589 has been deposited with ATCC on August 18, 1998 and is assigned ATCC deposit no. 203127. 
5 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 288 (SEQ ID NO:403), evidenced significant 
homology between the PRO 1345 amino acid sequence and the following Dayhoff sequences: BTU22298J, 
TETNCARSP, TETNHUMAN, MABA_RAT, S34198, P W13144, MACMBPAl, A46274, PSPD^RAT 
ANDPJ132188. 

10 

EXAMPLE 130 : Isolation of cDNA clones Encoding Human PRO 1245 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 

15 EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

20 obtained therefrom is herein designated DNA56019. 

In light of an observed sequence homology between the DNA56019 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 1327836, the Incyte EST clone 1327836 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 289 and is herein designated as DNA64884-1527. 

25 The full length clone shown in Figure 289 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 79-81 and ending at the stop codon found at nucleotide positions 
391-393 (Figure 289; SEQ ID NO:407). The predicted polypeptide precursor (Figure 290, SEQ ID NO:408) 
is 104 amino acids long, with a signal peptide sequence at about amino acid 1 to about amino acid 18. 
PRO 1245 has a calculated molecular weight of approximately 10,100 daltons and an estimated pi of 

30 approximately 8.76. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 290 (SEQ ID NO:408), revealed some homology 
between the PR01245 amino acid sequence and the following Dayhoff sequences: SYA THETH, GEN1 1 167, 
MTV044_4, AB01 1151 J, RLAJ2750_3, SNELIPTRA l, S63624, C28391, A37907, and S14064. 

35 Clone DNA64884-1245 was deposited with the ATCC on August 25, 1998 and is assigned ATCC 

deposit no. 203155. 
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EXAMPLE 131: Isolation of cDNA clones Encoding Human PRO 1358 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ® Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzymojogy. 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). 

In light of an observed sequence homology between the consensus sequence and an EST sequence 

10 encompassed within the Incyte EST clone no. 88718, the Incyte EST clone 88718 was purchased and the cDNA 
insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The sequence 
of this cDNA insert is shown in Figure 291 and is herein designated as DNA64890-1612. 

The full length clone shown in Figure 291 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 86 through 88 and ending at the stop codon found at nucleotide 

15 positions 1418 through 1420 (Figure 291; SEQ ID NO:409). The predicted polypeptide precursor (Figure 292, 
SEQ ID NO:410) is 444 amino acids long. The signal peptide is at about amino acids 1-18 of SEQ ID NO:410. 
PRO 1358 has a calculated molecular weight of approximately 50,719 daltons and an estimated pi of 
approximately 8.82. Clone DNA64890-1612 was deposited with the ATCC on August 18, 1998 and is assigned 
ATCC deposit no. 203131. 

20 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 292 (SEQ ID NO:4 10), revealed sequence identity 
between the PR01358 amino acid sequence and the following Dayhoff sequences: P W07607, AB000545_1, 
AB000546J , Al ATRAT, ABO 151 64 J, PJ>50021, COTR_CAVPO, and HAMHPPJ . The variants claimed 
in this application exclude these sequences. 

25 

EXAMPLE 132 : Isolation of cDNA clones Encoding Human PROl 195 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 

30 EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

35 obtained therefrom is herein designated DNA55716. 

In light of an observed sequence homology between the DNA55716 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3252980, the Incyte EST clone 3252980 was purchased 
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and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 293 and is herein designated as DNA65412-1523. 

The full length clone shown in Figure 293 contained a single open reading frame with an apparent 
transiational initiation site at nucleotide positions 58-60 and ending at the stop codon found at nucleotide positions 
511-513 (Figure 293; SEQ ID NO:411). The predicted polypeptide precursor (Figure 294, SEQ ID NO:412) 
5 is 151 amino acids long. The signal sequence is at about amino acids 1-22 of SEQ ID NO:412. PROl 195 has 
a calculated molecular weight of approximately 17,277 daltons and an estimated pi of approximately 5.33. Clone 
DNA65412-1523 was deposited with the ATCC on August 4, 1998 and is assigned ATCC deposit no. 203094. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 294 (SEQ ID NO:412), revealed some sequence 
10 identity between the PROl 195 amino acid sequence and the following Dayhoff sequences: MMU28486_1, 
AF044205J, PJV31186, CELK03C7J, F69034, EF1AMETVA, AF024540_1, SSU90353J, 
MRSP_STAAU and P_R97680. 

EXAMPLE 133 : Isolation of cDNA clones Encoding Human PRO1270 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 

20 Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap w (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57951. 

In light of an observed sequence homology between the DNA57951 consensus sequence and an EST 

25 sequence encompassed within the Merck EST clone no. 124878, the Merck EST clone 124878 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 295 and is herein designated as DNA66308-1537. 

Clone DNA66308-1537 contains a single open reading frame with an apparent transiational initiation 
site at nucleotide positions 103-105 and ending at the stop codon at nucleotide positions 1042-1044 (Figure 295). 

30 The predicted polypeptide precursor is 313 amino acids long (Figure 296). The full-length PRO 1270 protein 
shown in Figure 296 has an estimated molecular weight of about 34,978 daltons and a pi of about 5.71 . Analysis 
of the full-length PRO1270 sequence shown in Figure 296 (SEQ ID NO:414) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 16, a potential N-glycosylation site from 
about amino acid 163 to about amino acid 166 and glycosaminoglycan attachment sites from about amino acid 

35 74 to about amino acid 77 and from about amino acid 289 to about amino acid 292. Clone DNA66308-1537 has 
been deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 203159. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 296 (SEQ ID NO:414), evidenced significant 
homology between the PRO1270 amino acid sequence and the following Dayhoff sequences: XLU86699 1, 
S49589, FIBA_PARPA,FIBB_HUMAN,P_R47189, AF004326_1,DRTENASCN_1, AF004327J, P_W0141 1 
and FIBG_BOVIN. 

5 

EXAMPLE 134: Isolation of cDNA clones Encoding Human PRQ1271 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 

10 EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program u phrap n (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

15 obtained therefrom is herein designated DNA57955. 

In light of an observed sequence homology between the DNA57955 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA625350, the Merck EST clone AA625350 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 297 and is herein designated as DNA66309-1538. 

20 Clone DNA66309-1538 contains a single open reading frame with an apparent translational initiation 

site at nucleotide positions 94-96 and ending at the stop codon at nucleotide positions 7 18-720 (Figure 297). The 
predicted polypeptide precursor is 208 amino acids long (Figure 298). The full-length PR01271 protein shown 
in Figure 298 has an estimated molecular weight of about 21,531 daltons and a pi of about 8.99. Analysis of 
the full-length PR01271 sequence shown in Figure 298 (SEQ ID NO:416) evidences the presence of the 

25 following: a signal peptide from about amino acid 1 to about amino acid 3 1 and a transmembrane domain from 
about amino acid 166 to about amino acid 187. Clone DNA66309-1538 has been deposited with ATCC on 
September 15, 1998 and is assigned ATCC deposit no. 203235. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 298 (SEQ ID NO:416), evidenced significant 

30 homology between the PR01271 amino acid sequence and the following Dayhoff sequences: S57180, S63257, 
AGA1YEAST, BPU43599J, YS8ACAEEL, S67570, LSU54556_2, S7O305, VGLX_HSVEB, and 
D88733J. 

EXAMPLE 135: Isolation of cDNA clones Encoding Human PRQ1375 
35 A Merck/Wash. U. database was searched and a Merck EST was identified. This sequence was then 

put in a program which aligns it with other seequences from the Swiss-Prot public database, public EST 
databases (e.g., GenBank, Merck/Wash. U.), and a proprietary EST database (LIFESEQ®, Incyte 
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Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program BLAST or BLAST2 
[Altschul et al M Methods in Enzvmology. 266:460-480 (1996)] as a comparison of the extracellular domain 
(ECD) protein sequences to a 6 frame translation of the EST sequences. Those comparisons resulting in a 
BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were clustered and 
assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, 
5 Seattle, Washington). 

A consensus DNA sequence was assembled relative to other EST sequences using phrap. This 
consensus sequence is designated herein "DNA67003". 

Based on theDNA67003 consensus sequence, the nucleic acid (SEQ ID NO:417) was identified in a 
human pancreas library. DNA sequencing of the clone gave the full-length DNA sequence for PR01375 and 
10 the derived protein sequence for PRO 1 375 . 

The entire coding sequence of PR01375 is shown in Figure 299 (SEQ ID NO:4 17). Clone DNA67004- 
1614 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
104-106 and an apparent stop codon at nucleotide positions 698-700 of SEQ ID NO:417. The predicted 
polypeptide precursor is 198 amino acids long. The transmembrane domains are at about amino acids 1 1-28 
15 (type II) and 103-125 of SEQ ID NO:418. Clone DNA67004-1614 has been deposited with ATCC and is 
assigned ATCC deposit no. 203115. The full-length PR01375 protein shown in Figure 300 has an estimated 
molecular weight of about 22,531 daltons and a pi of about 8.47. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 300 (SEQ ID NO:4 1 8), revealed sequence identity 
20 between the PR01375 amino acid sequence and the following Dayhoff sequences: AF026198_5, CELR12C125, 
S73465, Y011_MYCPN, S64538J, P_P8150, MUVSHPO10J, VSH MUMPL and CVU59751_5. 

EXAMPLE 136: Isolation of cDNA clones Encoding Human PRO 1385 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

25 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

30 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57952. 

In light of an observed sequence homology between the DNA57952 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3129630, the Incyte EST clone 3129630 was purchased 

35 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 301 and is herein designated as DNA68869-1610. 
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Clone DNA68869-1610 contains a single open reading frame with an apparent translation^ initiation 
site at nucleotide positions 26-28 and ending at the stop codon at nucleotide positions 410-412 (Figure 301). The 
predicted polypeptide precursor is 128 amino acids long (Figure 302). The full-length PR01385 protein shown 
in Figure 302 has an estimated molecular weight of about 13,663 daltons and a pi of about 10.97. Analysis of 
the full-length PR01385 sequence shown in Figure 302 (SEQ ID NO:420) evidences the presence of the 
5 following: a signal peptide from about amino acid 1 to about amino acid 28, and glycosylaminoglycan attachment 
sites from about amino acid 82 to about amino acid 85 and from about amino acid 91 to about amino acid 94. 
Clone DNA68869-1610 has been deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 
203164. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
10 alignment analysis of the full-length sequence shown in Figure 302 (SEQ ID NO:420), evidenced low homology 
between the PR01385 amino acid sequence and the following Dayhoff sequences: CELT14A8_1, 
LMNACHRAll, HXD9_HUMAN, CHKCMLFl, HS5PP34 2, DMDRINGJ, A37107J, 
MMLUNGENEJ, PUM_DROME and DMU25117J. 

15 EXAMPLE 137: Isolation of cDNA clones Encoding Human PRO 1387 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

20 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56259. 

25 In light of an observed sequence homology between the DNA56259 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3507924, the Incyte EST clone 3507924 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 303 and is herein designated as DNA68872-1620. 

Clone DNA68872-1620 contains a single open reading frame with an apparent translational initiation 

30 site at nucleotide positions 85-87 and ending at the stop codon at nucleotide positions 1267-1269 (Figure 303). 
The predicted polypeptide precursor is 394 amino acids long (Figure 304). The full-length PR01387 protein 
shown in Figure 304 has an estimated molecular weight of about 44,339 daltons and a pi of about 7.10. Analysis 
of the full-length PR01387 sequence shown in Figure 304 (SEQ ID NO:422) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 19, a transmembrane domain from about 

35 amino acid 275 to about amino acid 296, potential N-glycosylation sites from about amino acid 76 to about amino 
acid 79, from about amino acid 231 to about amino acid 234, from about amino acid 302 to about amino acid 
305, from about amino acid 307 to about amino acid 310 and from about amino acid 376 to about amino acid 
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379, and amino acid sequence blocks having homology to myelin pO protein from about amino acid 210 to about 
amino acid 239 and from about amino acid 92 to about amino acid 121. Clone DNA68872-1620 has been 
deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 203160. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 304 (SEQ ID NO:422), evidenced significant 
5 homology between the PR01387 amino acid sequence and the following Dayhoff sequences: P_W36955, 
MYP0_HETFR, HS46KDAJ, AF049498J, MYO0HUMAN, AF030454J, A53268, SHPTCRAJ, 
P_W14146andGEN12838. 

EXAMPLE 138: Isolation of cD NA clones Encoding Human PRO 1384 
10 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is herein designated DNA54192. Based on the DNA54192 

sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 

of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO 1384. 
PCR primers (forward and reverse) were synthesized: 
15 forward PCR primer 5'-TGCAGCCCCTGTGACACAAACTGG-3* (SEQ ID NO:425) 

reverse PCR primer 5 ' -CTGAGATAACCGAGCC ATCCTCCCAC-3 ' (SEQ ID NO:426) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA54192 

sequence which had the following nucleotide sequence: 

hybridization probe 

20 5 , -GGAGATAGCTGCTATGGGTTCTTCAGGCACAACTTAACATGGGAAG-3 , (SEQ ID NO:427) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PR01384 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated from human fetal liver. 
25 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PRO 1384 (designated herein as DNA7 1159-1617 [Figure 305, SEQ ID NO:423]; and the derived protein 

sequence for PRO 1384. 

The entire coding sequence of PR01384 is shown in Figure 305 (SEQ ID NO:423). Clone DNA71 159- 
1617 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 

30 1 82- 1 84 and an apparent stop codon at nucleotide positions 869-87 1 . The predicted polypeptide precursor is 229 
amino acids long. The full-length PRO 1384 protein shown in Figure 306 has an estimated molecular weight of 
about 26,650 daltons and a pi of about 8.76. Additional features include a type II transmembrane domain at 
about amino acids 32-57, and potential N-glycosylation sites at about amino acids 68-71, 120-123, and 134-137. 
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 306 (SEQ ID NO:424), revealed homology 
between the PR01384 amino acid sequence and the following Dayhoff sequences: AF054819_1 , HSAJ 1687_1 , 
AF00951 1 J, AB010710J,GEN13595, HSAJ673J,G^ 
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and NK13_RAT. 

Clone DNA71 159-1617 has been deposited with ATCC and is assigned ATCC deposit no. 203135. 

EXAMPLE 139: Use of PRO as a hybridization probe 

The following method describes use of a nucleotide sequence encoding PRO as a hybridization probe. 

DNA comprising the coding sequence of full-length or mature PRO as disclosed herein is employed as 
a probe to screen for homologous DNAs (such as those encoding naturally-occurring variants of PRO) in human 
tissue cDNA libraries or human tissue genomic libraries. 

Hybridization and washing of filters containing either library DNAs is performed under the following 
high stringency conditions. Hybridization of radiolabeled PRO-derived probe to the filters is performed in a 
solution of 50% formamide, 5x SSC, 0. 1 % SDS, 0. 1 % sodium pyrophosphate, 50 raM sodium phosphate, pH 
6.8, 2x Denhardfs solution, and 10% dextran sulfate at 42°C for 20 hours. Washing of the filters is performed 
in an aqueous solution of O.lx SSC and 0.1% SDS at 42°C. 

DNAs having a desired sequence identity with the DNA encoding full-length native sequence PRO can 
then be identified using standard techniques known in the art. 

EXAMPLE 140 : Expression of PRO in E. coli 

This example illustrates preparation of an unglycosylated form of PRO by recombinant expression in 

E. coli. 

The DNA sequence encoding PRO is initially amplified using selected PCR primers. The primers 
should contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected 
expression vector. A variety of expression vectors may be employed. An example of a suitable vector is 
pBR322 (derived from E. coli; see Bolivar et ah, Gene. 2:95 (1977)) which contains genes for ampicillin and 
tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylated. The PCR 
amplified sequences are then ligated into the vector. The vector will preferably include sequences which encode 
for an antibiotic resistance gene, a trp promoter, a polyhis leader (including the first six STII codons, polyhis 
sequence, and enterokinase cleavage site), the PRO coding region, lambda transcriptional terminator, and an 
argU gene. 

The ligation mixture is then used to transform a selected £. coli strain using the methods described in 
Sambrook et al. , supra . Transformants are identified by their ability to grow on LB plates and antibiotic resistant 
colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 
sequencing. 

Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with 
antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The cells are 
then grown to a desired optical density, during which the expression promoter is turned on. 

After culturing the cells for several more hours, the cells can be harvested by centrifugation. The cell 
pellet obtained by the centrifugation can be solubilized using various agents known in the art, and the solubilized 
PRO protein can then be purified using a metal chelating column under conditions that allow tight binding of the 
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protein. 

PRO may be expressed in E. coli in a poly-His tagged form, using the following procedure. The DNA 
encoding PRO is initially amplified using selected PCR primers. The primers will contain restriction enzyme 
sites which correspond to the restriction enzyme sites on the selected expression vector, and other useful 
sequences providing for efficient and reliable translation initiation, rapid purification on a metal chelation 
5 column, and proteolytic removal with enterokinase. The PCR-amplified, poly-His tagged sequences are then 
ligated into an expression vector, which is used to transform an E. coli host based on strain 52 (W3110 
fuhA(tonA) Ion galE rpoHts(htpRts) clpP(Iadq). Transformants are first grown in LB containing 50 mg/ml 
carbenicillin at 30°C with shaking until an O.D.600 of 3-5 is reached. Cultures are then diluted 50-100 fold into 
CRAP media (prepared by mixing 3.57 g (NH 4 ) 2 S0 4 , 0.71 g sodium citrate*2H20, 1.07 g KC1, 5.36 g Difco 

10 yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as well as 1 10 mM MPOS, pH 7.3, 0.55% (w/v) 
glucose and 7 mM MgS0 4 ) and grown for approximately 20-30 hours at 30°C with shaking. Samples are 
removed to verify expression by SDS-PAGE analysis, and the bulk culture is centrifuged to pellet the cells. Cell 
pellets are frozen until purification and refolding. 

£. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 M 

15 guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final 
concentrations of 0.1M and 0.02 M, respectively, and the solution is stirred overnight at 4 °C. This step results 
in a denatured protein with all cysteine residues blocked by sulfuolization. The solution is centrifuged at 40,000 
rpm in a Beckman Ultracentifuge for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate 
column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The 

20 clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column equilibrated in the metal chelate 
column buffer. The column is washed with additional buffer containing 50 mM imidazole (Calbiochem, Utrol 
grade), pH 7.4. The protein is eluted with buffer containing 250 mM imidazole. Fractions containing the 
desired protein are pooled and stored at 4°C Protein concentration is estimated by its absorbance at 280 nm 
using the calculated extinction coefficient based on its amino acid sequence. 

25 The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer consisting 

of: 20 mM Tris, pH 8.6, 0.3 M NaCl, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM EDTA. 
Refolding volumes are chosen so that the final protein concentration is between 50 to 100 micrograms/ml. The 
refolding solution is stirred gently at 4°C for 12-36 hours. The refolding reaction is quenched by the addition 
of TFA to a final concentration of 0.4% (pH of approximately 3). Before further purification of the protein, the 

30 solution is filtered through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The 
refolded protein is chromatographed on a Poros Rl/H reversed phase column using a mobile buffer of 0. 1 % 
TFA with elution with a gradient of acetonitrile from 10 to 80%. Aliquots of fractions with A280 absorbance 
are analyzed on SDS poly aery lamide gels and fractions containing homogeneous refolded protein are pooled. 
Generally, the properly refolded species of most proteins are eluted at the lowest concentrations of acetonitrile 

35 since those species are the most compact with their hydrophobic interiors shielded from interaction with the 
reversed phase resin. Aggregated species are usually eluted at higher acetonitrile concentrations. In addition 
to resolving misfolded forms of proteins from the desired form, the reversed phase step also removes endotoxin 
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from the samples. 

Fractions containing the desired folded PRO polypeptide are pooled and the acetonitrile removed using 
a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 6.8 with 
0. 14 M sodium chloride and 4 % mannitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) resins 
equilibrated in the formulation buffer and sterile filtered. 
5 Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

EXAMPLE 141 : Expression of PRO in mammalian cells 

This example illustrates preparation of a potentially glycosylated form of PRO by recombinant 
expression in mammalian cells. 
10 The vector, pRK5 (see EP 307,247, published March 15, 1989), is employed as the expression vector. 

Optionally, the PRO DNA is ligated into pRK5 with selected restriction enzymes to allow insertion of the PRO 
DNA using ligation methods such as described in Sambrook et al. , supra . The resulting vector is called pRK5- 
PRO. 

In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are 

15 grown to confluence in tissue culture plates in medium such as DM EM supplemented with fetal calf serum and 
optionally, nutrient components and/or antibiotics. About 10 fig pRK5-PRO DNA is mixed with about 1 ng 
DNA encoding the VA RNA gene [Thimmappaya et al., Cell, XL:543 (1982)] and dissolved in 500 /il of 1 mM 
Tris-HCl, 0.1 mM EDTA, 0.227 M CaCl 2 . To this mixture is added, dropwise, 500 id of 50 mM HEPES (pH 
7.35), 280 mM NaCl, 1.5 mM NaP0 4 , and a precipitate is allowed to form for 10 minutes at 25°C. The 

20 precipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 37°C. The 
culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are 
then washed with serum free medium, fresh medium is added and the cells are incubated for about 5 days. 

Approximately 24 hours after the transfections, the culture medium is removed and replaced with culture 
medium (alone) or culture medium containing 200 pCi/ml 35 S-cysteine and 200 /iCi/ml ^methionine. After 

25 a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto a 15 % 
SDS gel. The processed gel may be dried and exposed to film for a selected period of time to reveal the 
presence of PRO polypeptide. The cultures containing transfected cells may undergo further incubation (in 
serum free medium) and the medium is tested in selected bioassays. 

In an alternative technique, PRO may be introduced into 293 cells transiently using the dextran sulfate 

30 method described by Somparyrac et al., Proc. Natl. Acad. Sci.. J2:7575 (1981). 293 cells are grown to 
maximal density in a spinner flask and 700 /xg pRK5-PRO DNA is added. The cells are first concentrated from 
the spinner flask by centrifiigation and washed with PBS. The DNA-dextran precipitate is incubated on the cell 
pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture 
medium, and re-introduced into the spinner flask containing tissue culture medium, 5 /ig/ml bovine insulin and 

35 0. 1 itg/ml bovine transferrin. After about four days, the conditioned media is centrifuged and filtered to remove 
cells and debris. The sample containing expressed PRO can then be concentrated and purified by any selected 
method, such as dialysis and/or column chromatography. 
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In another embodiment, PRO can be expressed in CHO cells. The pRK5-PRO can be transfected into 
CHO cells using known reagents such as CaP0 4 or DEAE-dextran. As described above, the cell cultures can 
be incubated, and the medium replaced with culture medium (alone) or medium containing a radiolabel such as 
35 S-methionine. After deterniining the presence of PRO polypeptide, the culture medium may be replaced with 
serum free medium. Preferably, the cultures are incubated for about 6 days, and then the conditioned medium 
5 is harvested. The medium containing the expressed PRO can then be concentrated and purified by any selected 
method. 

Epitope-tagged PRO may also be expressed in host CHO cells. The PRO may be subcloned out of the 
pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as a poly- 
his tag into a Baculovirus expression vector. The poly-his tagged PRO insert can then be subcloned into a SV40 

10 driven vector containing a selection marker such as DHFR for selection of stable clones. Finally, the CHO cells 
can be transfected (as described above) with the SV40 driven vector. Labeling may be performed, as described 
above, to verify expression. The culture medium containing the expressed poly-His tagged PRO can then be 
concentrated and purified by any selected method, such as by Ni 2+ -chelate affinity chromatography. 

PRO may also be expressed in CHO and/or COS cells by a transient expression procedure or in CHO 

15 cells by another stable expression procedure. 

Stable expression in CHO cells is performed using the following procedure. The proteins are expressed 
as an IgG construct (immunoadhesin), in which the coding sequences for the soluble forms (e.g. extracellular 
domains) of the respective proteins are fused to an IgGl constant region sequence containing the hinge, CH2 and 
CH2 domains and/or is a poly-His tagged form. 

20 Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector using 

standard techniques as described in Ausubel et al. t Current Protocols of Molecular Biology. Unit 3. 16, John 
Wiley and Sons (1997). CHO expression vectors are constructed to have compatible restriction sites 5' and 3' 
of the DNA of interest to allow the convenient shuttling of cDNA's. The vector used expression in CHO cells 
is as described in Lucas et al., Nucl. Acids Res. 24:9 (1774-1779 (1996), and uses the SV40 early 

25 promoter/enhancer to drive expression of the cDNA of interest and dihydrofolate reductase (DHFR). DHFR 
expression permits selection for stable maintenance of the plasmid following transfection. 

Twelve micrograms of the desired plasmid DNA is introduced into approximately 10 million CHO cells 
using commercially available transfection reagents Superfect" (Quiagen), Dosper* or Fugene* (Boehringer 
Mannheim). The cells are grown as described in Lucas et al., supra . Approximately 3 x 10" 7 cells are frozen 

30 in an ampule for further growth and production as described below. 

The ampules containing the plasmid DNA are thawed by placement into water bath and mixed by 
vortexing. The contents are pipetted into a centrifuge tube containing 10 mLs of media and centrifuged at 1000 
rpm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective media (0.2 
tim filtered PS20 with 5% 0.2 diafiltered fetal bovine serum). The cells are then aliquoted into a 100 mL 

35 spinner containing 90 mL of selective media. After 1-2 days, the cells are transferred into a 250 mL spinner 
filled with 150 mL selective growth medium and incubated at 37?C. After another 2-3 days, 250 mL, 500 mL 
and 2000 mL spinners are seeded with 3 x 10 5 cells/mL. The cell media is exchanged with fresh media by 
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centrifugation and resuspension in production medium. Although any suitable CHO media may be employed, 
a production medium described in U.S. Patent No. 5,122,469, issued June 16, 1992 may actually be used. A 
3L production spinner is seeded at 1.2 x 10 6 cells/mL. On day 0, the cell number pH ie determined. On day 
1, the spinner is sampled and sparging with filtered air is commenced. On day 2, the spinner is sampled, the 
temperature shifted to 33°C, and 30 mL of 500 g/L glucose and 0.6 mL of 10% antifoam (e.g., 35% 
5 polydimethylsiloxane emulsion, Dow Corning 365 Medical Grade Emulsion) taken. Throughout the production, 
the pH is adjusted as necessary to keep it at around 7.2. After 10 days, or until the viability dropped below 
70%, the cell culture is harvested by centrifugation and filtering through a 0.22 /^m filter. The filtrate was either 
stored at 4°C or immediately loaded onto columns for purification. 

For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). Before 

10 purification, imidazole is added to the conditioned media to a concentration of 5 mM. The conditioned media 
is pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4, buffer containing 0.3 M NaCl 
and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4°C. After loading, the column is washed with additional 
equilibration buffer and the protein eluted with equilibration buffer containing 0.25 M imidazole. The highly 
purified protein is subsequently desalted into a storage buffer containing 10 mM Hepes, 0. 14 M NaCl and 4% 

15 mannitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and stored at -80°C. 

Immunoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. The 
conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 
mM Na phosphate buffer, pH 6.8. After loading, the column is washed extensively with equilibration buffer 
before elution with 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by collecting 1 

20 ml fractions into tubes containing 275 a<L of 1 M Tris buffer, pH 9. The highly purified protein is subsequently 
desalted into storage buffer as described above for the poly-His tagged proteins. The homogeneity is assessed 
by SDS polyacrylamide gels and by N-terminal amino acid sequencing by Edraan degradation. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

25 EXAMPLE 142: Expression of PRO in Yeast 

The following method describes recombinant expression of PRO in yeast. 

First, yeast expression vectors are constructed for intracellular production or secretion of PRO from 
the ADH2/GAPDH promoter. DNA encoding PRO and the promoter is inserted into suitable restriction enzyme 
sites in the selected plasmid to direct intracellular expression of PRO. For secretion, DNA encoding PRO can 

30 be cloned into the selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, a native PRO 
signal peptide or other mammalian signal peptide, or, for example, a yeast alpha-factor or invertase secretory 
signal/leader sequence, and linker sequences (if needed) for expression of PRO. 

Yeast cells, such as yeast strain AB 1 10, can then be transformed with the expression plasmids described 
above and cultured in selected fermentation media. The transformed yeast supernatants can be analyzed by 

35 precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of the gels with 
Coomassie Blue stain. 
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Recombinant PRO can subsequently be isolated and purified by removing the yeast cells from the 
fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. The 
concentrate containing PRO may further be purified using selected column chromatography resins. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 



5 EXAMPLE 143: Expression of PRO in Baculovirus-Infected Insect Cells 

The following method describes recombinant expression of PRO in Baculovirus-infected insect cells. 
The sequence coding for PRO is fused upstream of an epitope tag contained within a baculovirus 
expression vector. Such epitope tags include poly-his tags and immunoglobulin tags (like Fc regions of IgG). 
A variety of plasmids may be employed, including plasmids derived from commercially available plasmids such 

10 as pVL1393 (Novagen). Briefly, the sequence encoding PRO or the desired portion of the coding sequence of 
PRO such as the sequence encoding the extracellular domain of a transmembrane protein or the sequence 
encoding the mature protein if the protein is extracellular is amplified by PCR with primers complementary to 
the 5* and 3' regions. The 5* primer may incorporate flanking (selected) restriction enzyme sites. The product 
is then digested with those selected restriction enzymes and subcloned into the expression vector. 

15 Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGold™ virus 

DNA (Pharmingen) into Spodopterafrugiperda ("Sf?") cells (ATCC CRL 171 1) using lipofectin (commercially 
available from GIBCO-BRL). After 4-5 days of incubation at 28°C, the released viruses are harvested and used 
for further amplifications. Viral infection and protein expression are performed as described by O'Reilley et 
al., Baculovirus expression vectors: A Laboratory Manual. Oxford: Oxford University Press (1994). 

20 Expressed poly-his tagged PRO can then be purified, for example, by Ni 2+ -chelate affinity 

chromatography as follows. Extracts are prepared from recombinant virus- infected Sf9 cells as described by 
Rupert et ah, Nature . 362: 175-179 (1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer (25 
mL Hepes, pH 7.9; 12.5 mM MgCl 2 ; 0.1 mM EDTA; 10% glycerol; 0. 1 % NP-40; 0.4 M KC1), and sonicated 
twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the supernatant is diluted 50-fold 

25 in loading buffer (50 mM phosphate, 300 mM NaCl, 10% glycerol, pH 7.8) and filtered through a 0.45 //m 
filter. A Ni 2+ -NTA agarose column (commercially available from Qiagen) is prepared with a bed volume of 5 
mL, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The filtered cell extract is 
loaded onto the column at 0.5 mL per minute. The column is washed to baseline A 280 with loading buffer, at 
which point fraction collection is started. Next, the column is washed with a secondary wash buffer (50 mM 

30 phosphate; 300 mM NaCl, 10% glycerol, pH 6.0), which elutes nonspecifically bound protein. After reaching 
A 28 o baseline again, the column is developed with a 0 to 500 mM Imidazole gradient in the secondary wash 
buffer. One mL fractions are collected and analyzed by SDS-PAGE and silver staining or Western blot with 
Ni 2+ -NTA-conjugated to alkaline phosphatase (Qiagen). Fractions containing the eluted His l0 -tagged PRO are 
pooled and dialyzed against loading buffer. 

35 Alternatively, purification of the IgG tagged (or Fc tagged) PRO can be performed using known 

chromatography techniques, including for instance, Protein A or protein G column chromatography. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 
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EXAMPLE 144 : Preparation of Antibodies that Bind PRO 

This example illustrates preparation of monoclonal antibodies which can specifically bind PRO. 

Techniques for producing the monoclonal antibodies are known in the art and are described, for 
instance, in Goding, supra . Immunogens that may be employed include purified PRO, fusion proteins containing 
PRO, and cells expressing recombinant PRO on the cell surface. Selection of the immunogen can be made by 
5 the skilled artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the PRO immunogen emulsified in complete Freund's 
adjuvant and injected subcutaneously or intraperitoneally in an amount from 1-100 micrograms. Alternatively, 
the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research, Hamilton, MT) and 
injected into the animal's hind foot pads. The immunized mice are then boosted 10 to 12 days later with 
10 additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the mice may also be 
boosted with additional immunization injections. Serum samples may be periodically obtained from the mice 
by retro-orbital bleeding for testing in ELISA assays to detect anti-PRO antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected 
with a final intravenous injection of PRO. Three to four days later, the mice are sacrificed and the spleen cells 
15 are harvested. The spleen cells are then fused (using 35% polyethylene glycol) to a selected murine myeloma 
cell line such as P3X63AgU.l, available from ATCC, No. CRL 1597. The fusions generate hybridoma cells 
which can then be plated in 96 well tissue culture plates containing HAT (hypoxanthine, aminopterin, and 
thymidine) medium to inhibit proliferation of non-fused cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against PRO. Determination of 
20 "positive" hybridoma cells secreting the desired monoclonal antibodies against PRO is within the skill in the art. 

The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c mice to produce 
ascites containing the anti-PRO monoclonal antibodies. Alternatively, the hybridoma cells can be grown in tissue 
culture flasks or roller bottles. Purification of the monoclonal antibodies produced in the ascites can be 
accomplished using ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, 
25 affinity chromatography based upon binding of antibody to protein A or protein G can be employed. 

EXAMPLE 145 : Purification of PRO Polypeptides Using Specific Antibodies 

Native or recombinant PRO polypeptides may be purified by a variety of standard techniques in the art 

of protein purification. For example, pro-PRO polypeptide, mature PRO polypeptide, or pre-PRO polypeptide 
30 is purified by immunoaffmity chromatography using antibodies specific for the PRO polypeptide of interest. In 

general, an immunoaffinity column is constructed by covalently coupling the anti-PRO polypeptide antibody to 

an activated chromatographic resin. 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation with ammonium 

sulfate or by purification on immobilized Protein A (Pharmacia LKB Biotechnology, Piscataway, N.J.). 
35 Likewise, monoclonal antibodies are prepared from mouse ascites fluid by ammonium sulfate precipitation or 

chromatography on immobilized Protein A. Partially purified immunoglobulin is covalently attached to a 

chromatographic resin such as CnBr-activated SEPH AROSE™ (Pharmacia LKB Biotechnology). The antibody 
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is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacturer's 
instructions. 

Such an immunoaffinity column is utilized in the purification of PRO polypeptide by preparing a fraction 
from cells containing PRO polypeptide in a soluble form. This preparation is derived by solubilization of the 
whole cell or of a subcellular fraction obtained via differential centrifugation by the addition of detergent or by 
5 other methods well known in the art. Alternatively, soluble PRO polypeptide containing a signal sequence may 
be secreted in useful quantity into the medium in which the cells are grown. 

A soluble PRO polypeptide-containing preparation is passed over the immunoaffinity column, and the 
column is washed under conditions that allow the preferential absorbance of PRO polypeptide {e.g. , high ionic 
strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt 
10 antibody/PRO polypeptide binding (e.g. , a low pH buffer such as approximately pH 2-3, or a high concentration 
of a chaotrope such as urea or thiocyanate ion), and PRO polypeptide is collected. 

EXAMPLE 146: Drue Screening 

This invention is particularly useful for screening compounds by using PRO polypeptides or binding 

15 fragment thereof in any of a variety of drug screening techniques. The PRO polypeptide or fragment employed 
in such a test may either be free in solution, affixed to a solid support, borne on a cell surface, or located 
intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids expressing the PRO polypeptide or fragment. Drugs are screened 
against such transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can be 

20 used for standard binding assays. One may measure, for example, the formation of complexes between PRO 
polypeptide or a fragment and the agent being tested. Alternatively, one can examine the diminution in complex 
formation between the PRO polypeptide and its target cell or target receptors caused by the agent being tested. 

Thus, the present invention provides methods of screening for drugs or any other agents which can 
affect a PRO polypeptide-associated disease or disorder. These methods comprise contacting such an agent with 

25 an PRO polypeptide or fragment thereof and assaying (I) for the presence of a complex between the agent and 
the PRO polypeptide or fragment, or (ii) for the presence of a complex between the PRO polypeptide or fragment 
and the cell, by methods well known in the art. In such competitive binding assays, the PRO polypeptide or 
fragment is typically labeled. After suitable incubation, free PRO polypeptide or fragment is separated from that 
present in bound form, and the amount of free or uncomplexed label is a measure of the ability of the particular 

30 agent to bind to PRO polypeptide or to interfere with the PRO polypepude/cell complex. 

Another technique for drug screening provides high throughput screening for compounds having suitable 
binding affinity to a polypeptide and is described in detail in WO 84/03564, published on September 13, 1984. 
Briefly stated, large numbers of different small peptide test compounds are synthesized on a solid substrate, such 
as plastic pins or some other surface. As applied to a PRO polypeptide, the peptide test compounds are reacted 

35 with PRO polypeptide and washed. Bound PRO polypeptide is detected by methods well known in the art. 
Purified PRO polypeptide can also be coated direcdy onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture the peptide and immobilize it on the 
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solid support. 

This invention also contemplates the use of competitive drug screening assays in which neutralizing 
antibodies capable of binding PRO polypeptide specifically compete with a test compound for binding to PRO 
polypeptide or fragments thereof. In this manner, the antibodies can be used to detect the presence of any 
peptide which shares one or more antigenic determinants with PRO polypeptide. 

5 

EXAMPLE 147: Rational Drue Design 

The goal of rational drug design is to produce structural analogs of biologically active polypeptide of 
interest (i.e. , a PRO polypeptide) or of small molecules with which they interact, e.g. , agonists, antagonists, or 
inhibitors. Any of these examples can be used to fashion drugs which are more active or stable forms of the 

10 PRO polypeptide or which enhance or interfere with the function of the PRO polypeptide in vivo (c./. , Hodgson, 
Bio/Technology. & 19-21 (1991)). 

In one approach, the three-dimensional structure of the PRO polypeptide, or of an PRO 
polypeptide-inhibitor complex, is determined by x-ray crystallography, by computer modeling or, most typically, 
by a combination of the two approaches. Both the shape and charges of the PRO polypeptide must be ascertained 

15 to elucidate the structure and to determine active site(s) of the molecule. Less often, useful information regarding 
the structure of the PRO polypeptide may be gained by modeling based on the structure of homologous proteins. 
In both cases, relevant structural information is used to design analogous PRO polypeptide-like molecules or to 
identify efficient inhibitors. Useful examples of rational drug design may include molecules which have improved 
activity or stability as shown by Braxton and Wells, Biochemistry. 31:7796-7801 (1992) or which act as 

20 inhibitors, agonists, or antagonists of native peptides as shown by Athauda et a/., J. Biochem., JJl:742-746 
(1993). 

It is also possible to isolate a target-specific antibody, selected by functional assay, as described above, 
and then to solve its crystal structure. This approach, in principle, yields a pharmacore upon which subsequent 
drug design can be based. It is possible to bypass protein crystallography altogether by generating anti-idiotypic 

25 antibodies (anti-ids) to a functional, pharmacologically active antibody. As a minor image of a mirror image, 
the binding site of the anti-ids would be expected to be an analog of the original receptor. The anti-id could then 
be used to identify and isolate peptides from banks of chemically or biologically produced peptides. The isolated 
peptides would then act as the pharmacore. 

By virtue of the present invention, sufficient amounts of the PRO polypeptide may be made available 

30 to perform such analytical studies as X-ray crystallography. In addition, knowledge of the PRO polypeptide 
amino acid sequence provided herein will provide guidance to those employing computer modeling techniques 
in place of or in addition to x-ray crystallography. 

35 
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Deposit of Material 

The following materials have been deposited with the American Type Culture Collection, 10801 
University Blvd., Manassas, VA 201 10-2209, USA (ATCC): 



Table 2 



10 



15 



Iridic Mai 




ueposii u 


DNA16422-1209 


209929 


June 2, 1998 


DNA16435-1208 


209930 


June 2, 1998 


DNA21624-1391 


209917 


June 2, 1998 


DNA23334-1392 


209918 


June 2, 1998 


DNA26288-1239 


209792 


April 21, 1998 


DNA26843-1389 


203099 


August 4, 1998 


DNA26844-1394 


209926 


June 2, 1998 


DNA30862-1396 


209920 


June 2, 1998 


DNA35680-1212 


209790 


April 21, 1998 


DNA40621-1440 


209922 


June 2, 1998 


DNA44161-1434 


209907 


May 27, 1998 


DNA44694-1500 


203114 


August 11, 1998 


DNA45495-1550 


203156 


August 25, 1998 


DNA47361-1154 


209431 


November 7, 1997 


DNA47394-1572 


203109 


August 11, 1998 


DNA48320-1433 


209904 


May 27, 1998 


DNA48334-1435 


209924 


June 2, 1998 


DNA48606-1479 


203040 


July 1, 1998 


DNA49141-1431 


203003 


June 23, 1998 


DNA49142-1430 


203002 


June 23, 1998 


DNA49143-1429 


203013 


June 23, 1998 


DNA49647-1398 


209919 


June 2, 1998 


DNA49819-1439 


209931 


June 2, 1998 


DNA49820-1427 


209932 


June 2, 1998 


DNA49821-1562 


209981 


June 16, 1998 


DNA52192-1369 


203042 


July 1, 1998 


DNA52598-1518 


203107 


August 11, 1998 


DNA53913-1490 


203162 


August 25, 1998 


DNA53978-1443 


209983 


June 16, 1998 


DNA53996-1442 


209921 


June 2, 1998 


DNA56041-1416 


203012 


June 23, 1998 


DNA56047-1456 


209948 


June 9, 1998 


DNA56050-1455 


203011 


June 23, 1998 


DNA561 10-1437 


203113 


August 11, 1998 


DNA561 13-1378 


203049 


July 1, 1998 


DNA56410-1414 


209923 


June 2, 1998 


DNA56436-1448 


209902 


May 27, 1998 


DNA56855-1447 


203004 


June 23, 1998 


DNA56859-1445 


203019 


June 23, 1998 


DNA56860-1510 


209952 


June 9, 1998 


DNA56865-1491 


203022 


June 23, 1998 


DNA56866-1342 


203023 


June 23, 1998 


DNA56868-1209 


203024 


June 23, 1998 


DNA56869-1545 


203161 


August 25, 1998 


DNA56870-1492 


209925 


June 2, 1998 


DNA57033-1403 


209905 


May 27, 1998 


DNA57037-1444 


209903 


May 27, 1998 


DNA57129-1413 


209977 


June 16, 1998 
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DNA57690-1374 
DNA57693-1424 
DNA57694-1341 
DNA57695-1340 
DNA57699-1412 
5 DNA57702-1476 
DNA57704-1452 
DNA57708-1411 
DNA57710-1451 
DNA5771 1-1501 

10 DNA57827-1493 
DNA57834-1339 
DNA57836-1338 
DNA57838-1337 
DNA57844-1410 

15 DNA58721-1475 
DNA58723-1588 
DNA58737-1473 
DNA58743-1609 
DNA58846-1409 

20 DNA58848-1472 
DNA58849-1494 
DNA58850-1495 
DNA58853-1423 
DNA58855-1422 

25 DNA59205-1421 
DNA5921 1-1450 
DNA59213-1487 
DNA59214-1449 
DNA59215-1425 

30 DNA59220-1514 
DNA59488-1603 
DNA59493-1420 
DNA59497-1496 
DNA59588-1571 

35 DNA59603-1419 
DNA59605-1418 
DNA59606-1471 
DNA59607-1497 
DNA59609-1470 

40 DNA59610-1559 
DNA59612-1466 
DNA59613-1417 
DNA59616-1465 
DNA59619-1464 

45 DNA59620-1463 
DNA59625-1498 
DNA59767-1489 
DNA59776-1600 
DNA59777-1480 

50 DNA59820-1549 
DNA59827-1426 
DNA59828-1608 
DNA59838-1462 
DNA59839-1461 

55 DNA5984 1-1460 
DNA59842-1502 



209950 

203008 

203017 

203006 

203020 

209951 

209953 

203021 

203048 

203047 

203045 

209954 

203025 

203014 

203010 

203110 

203133 

203136 

203154 

209957 

209955 

209958 

209956 

203016 

203018 

203009 

209960 

209959 

203046 

209961 

209962 

203157 

203050 

209941 

203106 

209944 

203005 

209945 

209957 

209963 

209990 

209947 

203007 

209991 

203041 

209989 

209992 

203108 

203128 

203111 

203129 

203089 

203158 

209976 

209988 

203044 

209982 



June 9, 1998 
June 23, 1998 
June 23, 1998 
June 23, 1998 
June 23, 1998 
June 9, 1998 
June 9, 1998 
June 23, 1998 
July 1, 1998 
July 1, 1998 
July 1, 1998 
June 9, 1998 
June 23, 1998 
June 23, 1998 
June 23, 1998 
August 11, 1998 
August 18, 1998 
August 18, 1998 
August 25, 1998 
June 9, 1998 
June 9, 1998 
June 9, 1998 
June 9, 1998 
June 23, 1998 
June 23, 1998 
June 23, 1998 
June 9, 1998 
June 9, 1998 
July 1, 1998 
June 9, 1998 
June 9, 1998 
August 25, 1998 
July 1, 1998 
June 4, 1998 
August 11, 1998 
June 9, 1998 
June 23, 1998 
June 9, 1998 
June 9, 1998 
June 9, 1998 
June 16, 1998 
June 9, 1998 
June 23, 1998 
June 16, 1998 
July 1, 1998 
June 16, 1998 
June 17, 1998 
August 11, 1998 
August 18, 1998 
August 11, 1998 
August 18, 1998 
August 4, 1998 
August 25, 1998 
June 16, 1998 
June 16, 1998 
July 1, 1998 
June 16, 1998 
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10 



15 



20 



25 



30 



35 



40 



45 



DNA5QR46-1501 


209978 


limp 16 lOOfi 


niMA50847-151 1 


7010Q8 


Anmicf A lOOO 

AUgUSl e * t I Wo 


DNAS9R4R-1517 


201088 


Ancmct A 1 0Ofi 
AUgUSl ISriro 


DNA59849-1504 


209986 


June 16 1QQ8 


DNA59853-1505 


209985 


June 16 1008 

1U, 1770 




200074 

tu/7 / *T 


Tnnp 16 1008 
Julie 10, lyyo 


DNA60283-1484 


203043 


Julv 1 1008 


DNA60615-1481 


209980 


Tnnf> 16 1008 

June io, iyyo 


DNA60619-1482 


209993 


Jimp 16 1008 

JUlIC IO, 1770 


DNA60621-1516 


203091 


Ancnict A 1008 
/\ugUM h, iyyo 


DNA60622-1525 




AUgUM *f , I770 


DNA60625-1507 


209975 


lime 16 1008 


DNA60627-1508 


203092 


A nonet A 1QQ8 
/\UgUM H, lyyo 


DNA60629-1481 


20QQ7Q 


Tn np 16 10Q8 

JUIlC IO, I77O 


DNA6 1755- 1554 


201 1 1 7 


Atimiet 1 1 1 QQQ 

Augusi yyo 


HNA61R71-1S74 


701 1 17 ' 


August 10, iyy<5 


DNA62814-1521 


701001 


Aumict A I OO C 

August iyvo 


DNA62872-1509 


701100 


Aumict A 1 OQO 

August lyvo 


DNA 62876-1 517 


701005 


AUgUSl lyVo 


nNA628Rl-151S 


701006 


AUgUSl 4, IWo 


DNA64R52-1589 

l^llAWOJi 1J07 


701177 


Anmict t & 1 QOQ 

August io, lyy© 


nNA64RR4-1577 


7011 


August zj, lyyo 


nNA64RQ0_l617 


701 111 

ZUj IjI 


August 10, lyyo 


DNA65412-1523 


203094 


August 4 1 998 


DNA66308-1537 


203159 


August 25, 1998 


DNA66309-1538 


203235 


September 15, 1998 


DNA67004-1614 


203115 


August 11, 1998 


DNA68869-1610 


203164 


August 25, 1998 


DNA68872-1620 


203160 


August 25, 1998 


DNA71 159-1617 


203135 


August 18, 1998 



These deposit were made under the provisions of the Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder (Budapest 
Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the date of deposit. The 
deposits will be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement 
between Genentech, Inc. and ATCC, which assures permanent and unrestricted availability of the progeny of 
the culture of the deposit to the public upon issuance of the pertinent U.S. patent or upon laying open to the 
public of any U.S. or foreign patent application, whichever comes first, and assures availability of the progeny 
to one determined by the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to 35 
USC §122 and the Commissioner's rules pursuant thereto (including 37 CFR §1 . 14 with particular reference to 
886 OG 638). 

The assignee of the present application has agreed that if a culture of the materials on deposit should 
die or be lost or destroyed when cultivated under suitable conditions, the materials will be promptly replaced on 
notification with another of the same. Availability of the deposited material is not to be construed as a license 
to practice the invention in contravention of the rights granted under the authority of any government in 
accordance with its patent laws. 

The foregoing written specification is considered to be sufficient to enable one skilled in the art to 
practice the invention. The present invention is not to be limited in scope by the construct deposited, since the 
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deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs 
that are functionally equivalent are within the scope of this invention. The deposit of material herein does not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 
aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the 
claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition 
to those shown and described herein will become apparent to those skilled in the art from the foregoing 
description and fall within the scope of the appended claims. 
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WHAT IS CLAIMED IS : 

1. Isolated nucleic acid having at least 80% sequence identity to a nucleotide sequence that 
encodes a polypeptide comprising an amino acid sequence selected from the group consisting of the amino acid 
sequence shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ ID NO:6), Figure 6 (SEQ ID NO:8), Figure 9 
(SEQ ID NO: 14), Figure 12 (SEQ ID NO:20), Figure 15 (SEQ ID NO:23), Figure 18 (SEQ ID NO:28), Figure 
5 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33), Figure 25 (SEQ ID NO:36), Figure 27 (SEQ ID NO:41), 
Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52), Figure 34 (SEQ ID NO:57), Figure 36 (SEQ ID 
NO:62), Figure 38 (SEQ ID NO:67), Figure 41 (SEQ ID NO:73), Figure 47 (SEQ ID NO:84), Figure 49 (SEQ 
ID NO:95), Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID NO:99), Figure 57 (SEQ ID NO: 103), Figure 64 
(SEQ ID NO: 1 13), Figure 66 (SEQ ID NO: 1 15), Figure 68 (SEQ ID NO: 1 17), Figure 70 (SEQ ID NO: 1 19), 

10 Figure 72 (SEQ ID NO: 124), Figure 74 (SEQ ID NO: 129), Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID 
NO: 138), Figure 83 (SEQ ID NO: 146), Figure 85 (SEQ ID NO: 148), Figure 88 (SEQ ID NO: 151), Figure 90 
(SEQ ID NO: 153), Figure 93 (SEQ ID NO: 156), Figure 95 (SEQ ID NO: 158), Figure 97 (SEQ ID NO: 160), 
Figure 99 (SEQ ID NO: 165), Figure 101 (SEQ ID NO: 167), Figure 103 (SEQ ID NO: 169), Figure 105 (SEQ 
ID NO: 171), Figure 109 (SEQ ID NO: 175), Figure 111 (SEQ ID NO: 177), Figure 113 (SEQ ID NO: 179), 

15 Figure 1 15 (SEQ ID NO: 181), Figure 1 17 (SEQ ID NO: 183), Figure 120 (SEQ ID NO: 189), Figure 122 (SEQ 
ID NO:194), Figure 125 (SEQ ID NO:197), Figure 127 (SEQ ID NO: 199), Figure 129 (SEQ ID NO:201), 
Figure 131 (SEQ ID NO:203), Figure 133 (SEQ ID NO:205), Figure 135 (SEQ ID NO:207), Figure 137 (SEQ 
ID NO:209), Figure 139 (SEQ ID NO:211), Figure 141 (SEQ ID NO:213), Figure 144 (SEQ ID NO:216), 
Figure 147 (SEQ ID NO:219), Figure 149 (SEQ ID NO:221), Figure 151 (SEQ ID NO:223), Figure 153 (SEQ 

20 ID NO:225), Figure 155 (SEQ ID NO:227), Figure 157 (SEQ ID NO:229), Figure 159 (SEQ ID NO:23I), 
Figure 161 (SEQ ID NO:236), Figure 163 (SEQ ID NO:241), Figure 165 (SEQ ID NO:246), Figure 167 (SEQ 
ID NO:248), Figure 169 (SEQ ID NO:250), Figure 171 (SEQ ID NO:253), Figure 174 (SEQ ID NO:256), 
Figure 176 (SEQ ID NO:258), Figure 178 (SEQ ID NO:260), Figure 180 (SEQ ID NO:262), Figure 182 (SEQ 
ID NO:264), Figure 184 (SEQ ID NO:266), Figure 186 (SEQ ID NO:268), Figure 188 (SEQ ID NO:270), 

25 Figure 190 (SEQ ID NO:272), Figure 192 (SEQ ID NO:274), Figure 194 (SEQ ID NO:276), Figure 196 (SEQ 
ID NO:278), Figure 198 (SEQ ID NO:281), Figure 200 (SEQ ID NO:283), Figure 202 (SEQ ID NO:285), 
Figure 204 (SEQ ID NO:287), Figure 206 (SEQ ID NO:289), Figure 208 (SEQ ID NO:291), Figure 210 (SEQ 
ID NO:293), Figure 212 (SEQ ID NO:295), Figure 214 (SEQ ID NO:297), Figure 216 (SEQ ID NO:299), 
Figure 218 (SEQ ID NO:301), Figure 220 (SEQ ID NO:303), Figure 226 (SEQ ID NO:309), Figure 228 (SEQ 

30 ID NO:314), Figure 230 (SEQ ID NO:319), Figure 233 (SEQ ID NO:326), Figure 235 (SEQ ID NO:334), 
Figure 238 (SEQ ID NO:340), Figure 240 (SEQ ID NO:345), Figure 242 (SEQ ID NO:347), Figure 244 (SEQ 
ID NO:349), Figure 246 (SEQ ID NO:351), Figure 248 (SEQ ID NO:353), Figure 250 (SEQ ID NO:355), 
Figure 252 (SEQ ID NO:357), Figure 254 (SEQ ID NO:359), Figure 256 (SEQ ID NO:361), Figure 258 (SEQ 
ID NO:363), Figure 260 (SEQ ID NO:365), Figure 262 (SEQ ID NO:367), Figure 264 (SEQ ID NO:369), 

35 Figure 266 (SEQ ID NO:371), Figure 268 (SEQ ID NO:373), Figure 270 (SEQ ID NO:375), Figure 272 (SEQ 
ID NO:377), Figure 274 (SEQ ID NO:379), Figure 276 (SEQ ID NO:381), Figure 278 (SEQ ID NO:387), 
Figure 280 (SEQ ID NO:389), Figure 282 (SEQ ID NO:394), Figure 284 (SEQ ID NO:399), Figure 286 (SEQ 
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ID NO:401), Figure 288 (SEQ ID NO:403), Figure 290 (SEQ ID NO:408), Figure 292 (SEQ ID NO:410), 
Figure 294 (SEQ ID NO:412), Figure 296 (SEQ ID NO:414), Figure 298 (SEQ ID NO:416), Figure 300 (SEQ 
ID NO:418), Figure 302 (SEQ ID NO:420), Figure 304 (SEQ ID NO:422) and Figure 306 (SEQ ID NO:424). 

2. The nucleic acid sequence of Claim 1 , wherein said nucleotide sequence comprises a nucleotide 
5 sequence selected from the group consisting of the sequence shown in Figure 1 (SEQ ID NO: 1), Figure 3 (SEQ 
ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID NO: 19), Figure 14 (SEQ 
ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ ID NO:32), Figure 24 
(SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 (SEQ ID NO:51), Figure 
33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 40 (SEQ ID NO:72), 

10 Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), Figure 52 (SEQ ID 
NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 1 12), Figure 65 (SEQ ID NO: 1 14), Figure 67 
(SEQ ID NO: 1 16), Figure 69 (SEQ ID NO: 118), Figure 71 (SEQ ID NO: 123), Figure 73 (SEQ ID NO: 128), 
Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), Figure 84 (SEQ ID 
NO:147), Figure 87 (SEQ ID NO:150), Figure 89 (SEQ ID NO:152), Figure 92 (SEQ ID NO:155), Figure 94 

15 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 (SEQ ID NO: 166), 
Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID NO: 174), Figure 1 10 (SEQ 
ID NO: 176), Figure 112 (SEQ ID NO: 178), Figure 114 (SEQ ID NO: 180), Figure 116 (SEQ ID NO: 182), 
Figure 119 (SEQ ID NO:188), Figure 121 (SEQ ID NO:193), Figure 124 (SEQ ID NO:196), Figure 126 (SEQ 
ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 132 (SEQ ID NO:204), 

20 Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID NO:210), Figure 140 (SEQ 
ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 148 (SEQ ID NO:220), 
Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID NO:226), Figure 156 (SEQ 
ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 162 (SEQ ID NO:240), 
Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID NO:249), Figure 170 (SEQ 

25 ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 177 (SEQ ID NO:259), 
Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID NO:265), Figure 185 (SEQ 
ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 191 (SEQ ID NO:273), 
Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID NO:280), Figure 199 (SEQ 
ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 205 (SEQ ID NO:288), 

30 Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID NO:294), Figure 213 (SEQ 
ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 219 (SEQ ID NO:302), 
Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID NO:318), Figure 232 (SEQ 
ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 239 (SEQ ID NO:344), 
Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID NO:350), Figure 247 (SEQ 

35 ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 253 (SEQ ID NO:358), 
Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID NO:364), Figure 261 (SEQ 
ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 267 (SEQ ID NO:372), 
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Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID NO:378), Figure 275 (SEQ 
ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 281 (SEQ ID NO:393), 
Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID NO:402), Figure 289 (SEQ 
ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:411), Figure 295 (SEQ ID NO:413), 
Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID NO:419), Figure 303 (SEQ 
5 ID NO:421) and Figure 305 (SEQ ID NO:423). 

3 . The nucleic acid of Claim 1 , wherein said nucleotide sequence comprises a nucleotide sequence 
selected from the group consisting of the full-length coding sequence of the sequence shown in Figure 1 (SEQ 
ID NO:l), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID 

10 NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID NO:51 j, Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 
40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 

15 NO:114), Figure 67 (SEQ ID NO:116), Figure 69 (SEQ ID NO:118), Figure 71 (SEQ ID NO:123), Figure 73 
(SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), 
Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 
NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 
(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 

20 NO: 174), Figure 110 (SEQ ID NO: 176), Figure 112 (SEQ ID NO: 178), Figure 114 (SEQ ID NO:180), Figure 
116 (SEQ ID NO: 182), Figure 119 (SEQ ID NO: 188), Figure 121 (SEQ ID NO: 193), Figure 124 (SEQ ID 
NO: 196), Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 
132 (SEQ ID NO:2Q4), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 

25 148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 
NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:26I), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 

30 NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 
191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 

35 219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 
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NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364), Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 
267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 
5 281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:41 1), Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 
NO:419), Figure 303 (SEQ ID NO:421) or Figure 305 (SEQ ID NO:423). 

10 . 4. Isolated nucleic acid which comprises the full-length coding sequence of the DNA deposited 

under any ATCC accession number shown in Table 2. 

5. A vector comprising the nucleic acid of Claim 1 . 

15 6. The vector of Claim 5 operably linked to control sequences recognized by a host cell 

transformed with the vector. 

7. A host cell comprising the vector of Claim 5. 

20 8. The host cell of Claim 7 wherein said cell is a CHO cell. 

9. The host cell of Claim 7 wherein said cell is an E. colL 



25 



10. The host cell of Claim 7 wherein said cell is a yeast cell. 

11. A process for producing a PRO polypeptides comprising culturing the host cell of Claim 7 
under conditions suitable for expression of said PRO polypeptide and recovering said PRO polypeptide from the 
cell culture. 

30 12. Isolated PRO polypeptide having at least 80% sequence identity to an amino acid sequence 

selected from the group consisting of the amino acid sequence shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ 
ID NO:6), Figure 6 (SEQ ID NO:8), Figure 9 (SEQ ID NO: 14), Figure 12 (SEQ ID NO:20), Figure 15 (SEQ 
ID NO:23), Figure 18 (SEQ ID NO:28), Figure 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33), Figure 25 
(SEQ ID NO:36), Figure 27 (SEQ ID NO:41), Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52), Figure 

35 34 (SEQ ID NO:57), Figure 36 (SEQ ID NO:62), Figure 38 (SEQ ID NO:67), Figure 41 (SEQ ID NO:73), 
Figure 47 (SEQ ID NO:84), Figure 49 (SEQ ID NO:95), Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID 
NO:99), Figure 57 (SEQ ID NO: 103), Figure 64 (SEQ ID NO: 1 13), Figure 66 (SEQ ID NO: 1 15), Figure 68 
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(SEQ ID NO:117), Figure 70 (SEQ ID NO:119), Figure 72 (SEQ ID NO:124), Figure 74 (SEQ ID NO: 129), 
Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID NO: 138), Figure 83 (SEQ ID NO: 146), Figure 85 (SEQ ID 
NO: 148), Figure 88 (SEQ ID NO: 151), Figure 90 (SEQ ID NO: 153), Figure 93 (SEQ ID NO: 156), Figure 95 
(SEQ ID NO: 158), Figure 97 (SEQ ID NO: 160), Figure 99 (SEQ ID NO: 165), Figure 101 (SEQ ID NO: 167), 
Figure 103 (SEQ ID NO:169), Figure 105 (SEQ ID NO:171), Figure 109 (SEQ ID NO: 175), Figure 111 (SEQ 
5 ID NO: 177), Figure 113 (SEQ ID NO: 179), Figure 115 (SEQ ID NO: 181), Figure 117 (SEQ ID NO: 183), 
Figure 120 (SEQ ID NO: 189), Figure 122 (SEQ ID NO: 194), Figure 125 (SEQ ID NO: 197), Figure 127 (SEQ 
ID NO: 199), Figure 129 (SEQ ID NO:201), Figure 131 (SEQ ID NO:203), Figure 133 (SEQ ID NO:205), 
Figure 135 (SEQ ID NO:207), Figure 137 (SEQ ID NO:209), Figure 139 (SEQ ID NO:21 1), Figure 141 (SEQ 
ID NO:213), Figure 144 (SEQ ID NO:216), Figure 147 (SEQ ID NO:219), Figure 149 (SEQ ID NO:221), 

10 Figure 151 (SEQ ID NO:223), Figure 153 (SEQ ID NO:225), Figure 155 (SEQ ID NO:227), Figure 157 (SEQ 
ID NO:229), Figure 159 (SEQ ID NO:231), Figure 161 (SEQ ID NO:236), Figure 163 (SEQ ID NO:241), 
Figure 165 (SEQ ID NO:246), Figure 167 (SEQ ID NO:248), Figure 169 (SEQ ID NO:250), Figure 171 (SEQ 
ID NO:253), Figure 174 (SEQ ID NO:256), Figure 176 (SEQ ID NO:258), Figure 178 (SEQ ID NO:260), 
Figure 180 (SEQ ID NO:262), Figure 182 (SEQ ID NO:264), Figure 184 (SEQ ID NO:266), Figure 186 (SEQ 

15 ID NO:268), Figure 188 (SEQ ID NO:270), Figure 190 (SEQ ID NO:272), Figure 192 (SEQ ID NO:274), 
Figure 194 (SEQ ID NO:276), Figure 196 (SEQ ID NO:278), Figure 198 (SEQ ID NO:281), Figure 200 (SEQ 
ID NO:283), Figure 202 (SEQ ID NO:285), Figure 204 (SEQ ID NO:287), Figure 206 (SEQ ID NO:289), 
Figure 208 (SEQ ID NO:291), Figure 210 (SEQ ID NO:293), Figure 212 (SEQ ID NO:295), Figure 214 (SEQ 
ID NO:297), Figure 216 (SEQ ID NO:299), Figure 218 (SEQ ID NO:301), Figure 220 (SEQ ID NO:303), 

20 Figure 226 (SEQ ID NO:309), Figure 228 (SEQ ID NO:314), Figure 230 (SEQ ID NO:319), Figure 233 (SEQ 
ID NO:326), Figure 235 (SEQ ID NO:334), Figure 238 (SEQ ID NO:340), Figure 240 (SEQ ID NO:345), 
Figure 242 (SEQ ID NO:347), Figure 244 (SEQ ID NO:349), Figure 246 (SEQ ID NO:351), Figure 248 (SEQ 
ID NO:353), Figure 250 (SEQ ID NO:355), Figure 252 (SEQ ID NO:357), Figure 254 (SEQ ID NO:359), 
Figure 256 (SEQ ID NO:361), Figure 258 (SEQ ID NO:363), Figure 260 (SEQ ID NO:365), Figure 262 (SEQ 

25 ID NO:367), Figure 264 (SEQ ID NO:369), Figure 266 (SEQ ID NO:371), Figure 268 (SEQ ID NO:373), 
Figure 270 (SEQ ID NO:375), Figure 272 (SEQ ID NO:377), Figure 274 (SEQ ID NO:379), Figure 276 (SEQ 
ID NO:381), Figure 278 (SEQ ID NO:387), Figure 280 (SEQ ID NO:389), Figure 282 (SEQ ID NO:394), 
Figure 284 (SEQ ID NO:399), Figure 286 (SEQ ID NO:401), Figure 288 (SEQ ID NO:403), Figure 290 (SEQ 
ID NO:408), Figure 292 (SEQ ID NO:410), Figure 294 (SEQ ID NO:412), Figure 296 (SEQ ID NO:414), 

30 Figure 298 (SEQ ID NO:416), Figure 300 (SEQ ID NO:418), Figure 302 (SEQ ID NO:420), Figure 304 (SEQ 
ID NO:422) and Figure 306 (SEQ ID NO:424). 

13. Isolated PRO polypeptide having at least 80% sequence identity to the amino acid sequence 
encoded by a nucleic acid molecule deposited under any ATCC accession number shown in Table 2. 

35 

14. A chimeric molecule comprising a polypeptide according to Claim 12 fused to a heterologous 
amino acid sequence. 
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15. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is an 
epitope tag sequence. 

16. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is a Fc 
region of an immunoglobulin. 

5 

17. An antibody which specifically binds to a PRO polypeptide according to Claim 12. 

18. The antibody of Claim 17 wherein said antibody is a monoclonal antibody. 
10 19. The antibody of Claim 17 wherein said antibody is a humanized antibody. 

20. The antibody of Claim 17 wherein said antibody is an antibody fragment. 

21. An isolated nucleic acid molecule which has at least 80% sequence identity to a nucleic acid 
15 which comprises a nucleotide sequence selected from the group consisting of that shown in Figure 1 (SEQ ID 

NO:l), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID 
NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 

20 40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 
NO: 114), Figure 67 (SEQ ID NO:116), Figure 69 (SEQ ID NO:i 18), Figure 71 (SEQ ID NO:123), Figure 73 
(SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), 
Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 

25 NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 
(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 
NO: 174), Figure 110 (SEQ ID NO: 176), Figure 112 (SEQ ID NO: 178), Figure 114 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO: 182), Figure 119 (SEQ ID NO: 188), Figure 121 (SEQ ID NO: 193), Figure 124 (SEQ ID 
NO: 196), Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 

30 132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 
148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 

35 NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 
NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:27I), Figure 
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191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 
219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308) ? Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
5 NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 
NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364), Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 

10 267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 
281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:41 1), Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 

15 NO:419), Figure 303 (SEQ ID NO:421) and Figure 305 (SEQ ID NO:423). 

22. An isolated nucleic acid molecule which has at least 80% sequence identity to the full-length 
coding sequence of a nucleotide sequence selected from the group consisting of that shown in Figure 1 (SEQ ID 
NO:l), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID 

20 NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 
40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 

25 NO:114), Figure 67 (SEQ ID NO:116), Figure 69 (SEQ ID NO:l 18), Figure 71 (SEQ ID NO:123), Figure 73 
(SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), 
Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 
NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 
(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 

30 NO: 174), Figure 1 10 (SEQ ID NO: 176), Figure 1 12 (SEQ ID NO: 178), Figure 1 14 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO: 182), Figure 119 (SEQ ID NO: 188), Figure 121 (SEQ ID NO: 193), Figure 124 (SEQ ID 
NO: 196), Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 
132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 

35 148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 
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NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 
NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 
191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
5 205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 
219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 

10 NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358) f Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364) t Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 
267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378). Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 

15 281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:41 1), Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 
NO:419), Figure 303 (SEQ ID NO:421) and Figure 305 (SEQ ID NO:423). 

20 23. An isolated extracellular domain of of PRO polypeptide. 

24. An isolated PRO polypeptide lacking its associated signal peptide. 

25. An isolated polypeptide having at least about 80% amino acid sequence identity to an 
25 extracellular domain of of PRO polypeptide . 

26. An isolated polypeptide having at least about 80% amino acid sequence identity to a PRO 
polypeptide lacking its associated signal peptide. 
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FIGURE 1 

CGGACGCGTGGGTGCGAGGCGAAGGTGACCGGGGACCGAGCATTTCAGATCTGCTCGGTAGA 
CCTGGTGCACCACCACCATGTTGGCTGCAAGGCTGGTGTGTCTCCGGACACTACCTTCTAGG 
GTTTTCCACCCAGCTTTCACCAAGGCCTCCCCTGTTGTGAAGAATTCCATCACGAAGAATCA 
ATGGCTGTTAACACCTAGCAGGGAATATGCCACCAAAACAAGAATTGGGATCCGGCGTGGGA 
GAACTGGCCAAGAACTCAAAGAGGCAGCATTGGAACCATCGATGGAAAAAATATTTAAAATT 
GATCAGATGGGAAGATGGTTTGTTGCTGGAGGGGCTGCTGTTGGTCTTGGAGCATTGTGCTA 
CTATGGCTTGGGACTGTCTAATGAGATTGGAGCTATTGAAAAGGCTGTAATTTGGCCTCAGT 
ATGTCAAGGATAGAATTCATTCCACCTATATGTACTTAGCAGGGAGTATTGGTTTAACAGCT 
TTGTCTGCCATAGCAATCAGCAGAACGCCTGTTCTCATGAACTTCATGATGAGAGGCTCTTG 
GGTGACAATTGGTGTGACCTTTGCAGCCATGGTTGGAGCTGGAATGCTGGTACGATCAATAC 
CATATGACCAGAGCCCAGGCCCAAAGCATCTTGCTTGGTTGCTACATTCTGGTGTGATGGGT 
GCAGTGGTGGCTCCTCTGACAATATTAGGGGGTCCTCTTCTCATCAGAGCTGCATGGTACAC 
AGCTGGCATTGTGGGAGGCCTCTCCACTGTGGCCATGTGTGCGCCCAGTGAAAAGTTTCTGA 
ACATGGGTGCACCCCTGGGAGTGGGCCTGGGTCTCGTCTTTGTGTCCTCATTGGGATCTATG 
TTTCTTCCACCTACCACCGTGGCTGGTGCCACTCTTTACTCAGTGGCAATGTACGGTGGATT 
AGTTCTTTTCAGCATGTTCCTTCTGTATGATACCCAGAAAGTAATCAAGCGTGCAGAAGTAT 
CACCAATGTATGGAGTTCAAAAATATGATCCCATTAACTCGATGCTGAGTATCTACATGGAT 
ACATTAAATATATTTATGCGAGTTGCAACTATGCTGGCAACTGGAGGCAACAGAAAGAAATG 
AAGTGACTCAGCTTCTGGCTTCTCTGCTACATCAAATATCTTGTTTAATGGGGCAGATATGC 
ATTAAATAGTTTGTACAAGCAGCTTTCGTTGAAGTTTAGAAGATAAGAAACATGTCATCATA 
TTTAAATGTTCCGGTAATGTGATGCCTCAGGTCTGCCTTTTTTTCTGGAGAATAAATGCAGT 
AATCCTCTCCCAAATAAGCACACACATTTTCAATTCTCATGTTTGAGTGATTTTAAAATGTT 
TTGGTGAATGTGAAAACTAAAGTTTGTGTCATGAGAATGTAAGTCTTTTTTCTACTTTAAAA 
TTTAGTAGGTTCACTGAGTAACTAAAATTTAGCAAACCTGTGTTTGCATATTTTTTTGGAGT 
GCAGAATATTGTAATTAATGTCATAAGTGATTTGGAGCTTTGGTAAAGGGACCAGAGAGAAG 
GAGTCACCTGCAGTCTTTTGTTTTTTTAAATACTTAGAACTTAGCACTTGTGTTATTGATTA 
GTGAGGAGCCAGTAAGAAACATCTGGGTATTTGGAAACAAGTGGTCATTGTTACATTCATTT 
GCTGAACTTAACAAAACTGTTCATCCTGAAACAGGCACAGGTGATGCATTCTCCTGCTGTTG 
CTTCTCAGTGCTCTCTTTCCAATATAGATGTGGTCATGTTTGACTTGTACAGAATGTTAATC 
ATACAGAGAATCCTTGATGGAATTATATATGTGTGTTTTACTTTTGAATGTTACAAAAGGAA 
ATAACTTTAAAACTATTCTCAAGAGAAAATATTCAAAGCATGAAATATGTTGCTTTTTCCAG 
AATACAAACAGTATACTCATG 
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FIGURE 2 

MLAARLVCLRTLPS RVFHPAFTKAS P WKNS I TKNQWLLT PS RE YATKTR I G I RRGRTGQEL 
KEAALEPSMEKIFKIDQMGRWFVAGGAAVGLGALCYYGLGLSNEIGAIEKAVIWPQYVKDRI 
HSTYMYLAGS IGLTALS AI AI SRTPVLMNFMMRGSWVT I GVTFAAMVGAGMLVRS I PYDQSP 
GPKHLAWLLHSGVMGAWAPLTILGGPLLIRAAW 

GVGLGLVFVSSLGSMFLPPTTVAGATLYSVAMYGGLVLFSMFLLYDTQKVIKRAEVSPMYGV 
QKYDPINSMLS I YMDTLNI FMRVATMLATGGNRKK 
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FIGURE 3 

GAAGGCTGCCTCGCTGGTCCGAATTCGGTGGCGCCACGTCCGCCCGTCTCCGCCTTCTGCAT 
CGCGGCTTCGGCGGCTTCCACCTAGACACCTAACAGTCGCGGAGCCGGCCGCGTCGTGAGGG 
GGTCGGCACGGGGAGTCGGGCGGTCTTGTGCATCTTGGCTACCTGTGGGTCGAAGATGTCGG 
ACATCGGAGACTGGTTCAGGAGCATCCCGGCGATCACGCGCTATTGGTTCGCCGCCACCGTC 
GCCGTGCCCTTGGTCGGCAAACTCGGCCTCATCAGCCCGGCCTACCTCTTCCTCTGGCCCGA 
AGCCTTCCTTTATCGCTTTCAGATTTGGAGGCCAATCACTGCCACCTTTTATTTGCCTGTGG 
GTCCAGGAACTGGATTTCTTTATTTGGTCAATTTATATTTCTTATATCAGTATTCTACGCGA 
CTTGAAACAGGAGCTTTTGATGGGAGGCCAGCAGACTATTTATTCATGCTCCTCTTTAACTG 
GATTTGCATCGTGATTACTGGCTTAGCAATGGATATGCAGTTGCTGATGATTCCTCTGATCA 
TGTCAGTACTTTATGTCTGGGCCCAGCTGAACAGAGACATGATTGTATCATTTTGGTTTGGA 
ACACGATTTAAGGCCTGCTATTTACCCTGGGTTATCCTTGGATTCAACTATATCATCGGAGG 
CTCGGTAATCAATGAGCTTATTGGAAATCTGGTTGGACATCTTTATTTTTTCCTAATGTTCA 
GATACCCAATGGACTTGGGAGGAAGAAATTTTCTATCCACACCTCAGTTTTTGTACCGCTGG 
CTGCCCAGTAGGAGAGGAGGAGTATCAGGATTTGGTGTGCCCCCTGCTAGCATGAGGCGAGC 
TGCTGATCAGAATGGCGGAGGCGGGAGACACAACTGGGGCCAGGGCTTTCGACTTGGAGACC 
AG TGAA GGGGCGGCCTCGGGCAGCCGCTCCTCTCAAGCCACATTTCCTCCCAGTGCTGGGTG 
CACTTAACAACTGCGTTCTGGCTAACACTGTTGGACCTGACCCACACTGAATGTAGTCTTTC 
AGTACGAGACAAAGTTTCTTAAATCCCGAAGAAAAATATAAGTGTTCCACAAGTTTCACGAT 
TCTCATTCAAGTCCTTACTGCTGTGAAGAACAAATACCAACTGTGCAAATTGCAAAACTGAC 
TACATTTTTTGGTGTCTTCTCTTCTCCCCTTTCCGTCTGAATAATGGGTTTTAGCGGGTCCT 
AATCTGCTGGCATTGAGCTGGGGCTGGGTCACCAAACCCTTCCCAAAAGGACCTTATCTCTT 
TCTTGCACACATGCCTCTCTCCCACTTTTCCCAACCCCCACATTTGCAACTAGAAAAAGTTG 
CCCATAAAATTGCTCTGCCCTTGACAGGTTCTGTTATTTATTGACTTTTGCCAAGGCTGGTC 
ACAACAATCATATTCACGTTATTTTCCCCTTTTGGTGGCAGAACTGTTACCAATAGGGGGAG 
AAGACAGCCACGGATGAAGCGTTTCTCAGCTTTTGGAATTGCTTCGACTGACATCCGTTGTT 
AACCGTTTGCCACTCTTCAGATATTTTTTATAAAAAAAGTACCACTGAGTTCATGAGGGCCA 
CAGATTGGTTATTAATGAGATACGAGGGTTGGTGCTGGGTGTTTGTTTCCTGAGCTAAGTGA 
TCAAGACTGTAGTGGAGTTGCAGCTAACATGGGTTAGGTTTAAACCATGGGGGATGCACCCC 
TTTGCGTTTCATATGTAGCCCTACTGGCTTTGTGTAGCTGGAGTAGTTGGGTTGCTTTGTGT 
TAGGAGGATCCAGATCATGTTGGCTACAGGGAGATGCTCTCTTTGAGAGGTCCTGGGCATTG 
ATTCCCATTTCAATCTCATTCTGGATATGTGTTCATTGAGTAAAGGAGGAGAGACCCTCATA 
CGCTATTTAAATGTCACTTTTTTGCCTATCCCCCGTTTTTTGGTCATGTTTCAATTAATTGT 
GAGGAAGGCGCAGCTCCTCTCTGCACGTAGATCATTTTTTAAAGCTAATGTAAGCACATCTA 
AGGGAATAACATGATTTAAGGTTGAAATGGCTTTAGAATCATTTGGGTTTGAGGGTGTGTTA 
TTTTGAGTCATGAATGTACAAGCTCTGTGAATCAGACCAGCTTAAATACCCACACCTTTTTT 
TCGTAGGTGGGCTTTTCCTATCAGAGCTTGGCTCATAACCAAATAAAGTTTTTTGAAGGCCA 
TGGCTTTTCACACAGTTATTTTATTTTATGACGTTATCTGAAAGCAGACTGTTAGGAGCAGT 
ATTGAGTGGCTGTCACACTTTGAGGCAACTAAAAAGGCTTCAAACGTTTTGATCAGTTTCTT 
TTCAGGAAACATTGTGCTCTAACAGTATGACTATTCTTTCCCCCACTCTTAAACAGTGTGAT 
GTGTGTTATCCTAGGAAATGAGAGTTGGCAAACAACTTCTCATTTTGAATAGAGTTTGTGTG 
TACTTCTCCATATTTAATTTATATGATAAAATAGGTGGGGAGAGTCTGAACCTTAACTGTCA 
TGTTTTGTTGTTCATCTGTGGCCACAATAAAGTTTACTTGTAAAATTTTAGAGGCCATTACT 
CCAATTATGTTGCACGTACACTCATTGTACAGGCGTGGAGACTCATTGTATGTATAAGAATA 
TTTCTGACAGTGAGTGACCCGGAGTCTCTGGTGTACCCTCTTACCAGTCAGCTGCCTGCGAG 
CAGTCATTTTTTCCTAAAGGTTTACAAGTATTTAGAACTTTTCAGTTCAGGGCAAAATGTTC 
ATGAAGTTATTCCTCTTAAACATGGTTAGGAAGCTGATGACGTTATTGATTTTGTCTGGATT 
ATGTTTCTGGAATAATTTTACCAAAACAAGCTATTTGAGTTTTGACTTGACAAGGCAAAACA 
TGACAGTGGATTCTCTTTACAAATGGAAAAAAAAAATCCTTATTTTGTATAAAGGACTTCCC 
TTTTTGTAAACTAATCCTTTTTATTGGTAAAAATTGTAAATTAAAATGTGCAACTTG 
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FIGURE 4 

MSDIGDWFRSIPAITRYWFAATVAVPLVGKLGLISPAYLFLWPEAFLYRFQIWRPITATFYF 
PVGPGTGFLYLVNLYFLYQYSTRLETGAFDGRPADYLFMLLFNWI C I VI TGLAMDMQLLM I P 
LIMSVLYWAQLNRDMIVSFWFGTRFKACYLPWVILGFNYIIGGSVINELIGNLVGHLYFFL 
MFRYPMDLGGRNFLSTPQFLYRWLPSRRGGVSGFGVPPASMRRAADQNGGGGRHNWGQGFRLGDQ 
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FIGURE 5 

GGGGCCGCGGTCTAGGGCGGCTACGTGTGTTGCCATAGCGACCATTTTGCATTAACTGGTTG 
GTAGCTTCTATCCTGGGGGCTGAGCGACTGCGGGCCAGCTCTTCCCCTACTCCCTCTCGGCT 
CCTTGTGGCCCAAAGGCCTAACCGGGGTCCGGCGGTCTGGCCTAGGGATCTTCCCCGTTGCC 
CCTTTGGGGCGGGATGGCTGCGGAAGAAGAAGACGAGGTGGAGTGGGTAGTGGAGAGCATCG 
CGGGGTTCCTGCGAGGCCCAGACTGGTCCATCCCCATCTTGGACTTTGTGGAACAGAAATGT 
GAAGTTAACTGCAAAGGAGGGCATGTGATAACTCCAGGAAGCCCAGAGCCGGTGATTTTGGT 
GGCCTGTGTTCCCCTTGTTTTTGATGATGAAGAAGAAAGCAAATTGACCTATACAGAGATTC 
ATCAGGAATACAAAGAACTAGTTGAAAAGCTGTTAGAAGGTTACCTCAAAGAAATTGGAATT 
AATGAAGATCAATTTCAAGAAGCATGCACTTCTCCTCTTGCAAAGACCCATACATCACAGGC 
CATTTTGCAACCTGTGTTGGCAGCAGAAGATTTTACTATCTTTAAAGCAATGATGGTCCAGA 
AAAACATTGAAATGCAGCTGCAAGCCATTCGAATAATTCAAGAGAGAAATGGTGTATTACCT 
GACTGCTTAACCGATGGCTCTGATGTGGTCAGTGACCTTGAACACGAAGAGATGAAAATCCT 
GAGGGAAGTTCTTAGAAAATCAAAAGAGGAATATGACCAGGAAGAAGAAAGGAAGAGGAAAA 
AACAGTTATCAGAGGCTAAAACAGAAGAGCCCACAGTGCATTCCAGTGAAGCTGCAATAATG 
AATAATTCCCAAGGGGATGGTGAACATTTTGCACACCCACCCTCAGAAGTTAAAATGCATTT 
TGCTAATCAGTCAATAGAACCTTTGGGAAGAAAAGTGGAAAGGTCTGAAACTTCCTCCCTCC 
CACAAAAAGGCCTGAAGATTGCTGGCTTAGAGCATGCGAGCATTGAAGGACCAATAGCAAAC 
TTATCAGTACTTGGAACAGAAGAACTTCGGCAACGAGAACACTATCTCAAGCAGAAGAGAGA 
TAAGTTGATGTCCATGAGAAAGGATATGAGGACTAAACAGATACAAAATATGGAGCAGAAAG 
GAAAACCCACTGGGGAGGTAGAGGAAATGACAGAGAAACCAGAAATGACAGCAGAGGAGAAG 
CAAACATTACTAAAGAGGAGATTGCTTGCAGAGAAACTCAAAGAAGAAGTTATTAATAAGTA 
ATAATTAAGAACAATTTAACAAAATGGAAGTTCAAATTGTCTTAAAAATAAATTATTTAGTC 
CTTACACTG 
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FIGURE 6 

MAAEEEDEVEWVVESIAGFLRGPDWSIPILDFVEQKCEVNCKGGHVITPGSPEPVILVACVP 
LVFDDEEESKLTYTE IHQEYKELVEKLLEGYLKE IG INEDQFQEACTSPLAKTHTSQAILQP 
VLAAEDFTIFKAMMVQKNIEMQLQAIRIIQERNGVLPDCLTDGSDVVSDLEHEEMKILREVL 
RKSKEEYDQEEERKRKKQLSEAKTEEPTVHSSEAAIMNNSQGDGEHFAHPPSEVKMHFANQS 
IEPLGRKVERSETSSLPQKGLKIPGLEHASIEGPIANLSVLGTEELRQREHYLKQKRDKLMS 
MRKDMRTKQIQNMEQKGKPTGEVEEMTEKPEMTAEEKQTLLKRRLLAEKLKEEVINK 



WO 99/63088 ^^yO PCT/US99/12252 

FIGURE 7 

GGGCACAGCACATGTGAAGTTTTTGATGATGAAGAAGAAAGCAAATTGACCTATACAGAGAT 
TCATCAGGAATACAAAGAACTAGTTGAAAAGCTGTTAGAAGGTTACCTCAAAGAAATTGGAA 
TTAATGAAGATCAATTTCAAGAAGCATGCACTTCTCCTCTTGCAAAGACCCATACATCACAG 
GCCATTTTTGCAACCTGTGTTGGCAGCAGAAGATTTTACTATCTTTAAAGCAATGATGGTCC 
AGAAAAAC ATTGAAATGC AGCTGC AAGC CATT CGAATAATT CAAGAGAGAAATGGTGTATTA 
CCTGACTGCTTAACCGATGGCTCTGATGTGGTCAGTGACCTTGAACACGAAGAGATGAAAAT 
CCTGAGGGAAGTTCTTAGAAAATCAAAAGAGGAATATGACCAGGAA 
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FIGURE 8 

GCGTGGTTTTTGTTCTGCAATAGGCGGCTTAGAGGGAGGGGCTTTTTCGCCTATACCTACTG 
TAGCTTCTCCACGTATGGACCCTAAAGGCTACTGCTGCTACTACGGGGCTAGACAGTTACTG 
TCTCAGCTCTAGGATGTGCGTTCTTCCACTAGAAGCTCTTCTGAGGGAGGTAATTAAAAAAC 
AGTGGAATGGAAAAACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTCAACAATGTATAC 
ATTCCTGCTAGGTGCCATATTCATTGCTTTAAGCTCAAGTCGCATCTTACTAGTGAAGTATT 
CTGCCAATGAAGAAAACAAGTATGATTATCTTCCAACTACTGTGAATGTGTGCTCAGAACTG 
GTGAAGCTAGTTTTCTGTGTGCTTGTGTCATTCTGTGTTATAAAGAAAGATCATCAAAGTAG 
AAATTTGAAATATGCTTCCTGGAAGGAATTCTCTGATTTCATGAAGTGGTCCATTCCTGCCT 
TTCTTTATTTCCTGGATAACTTGATTGTCTTCTATGTCCTGTCCTATCTTCAACCAGCCATG 
GCTGTTATCTTCTCAAATTTTAGCATTATAACAACAGCTCTTCTATTCAGGATAGTGCTGAA 
GAGGCGTCTAAACTGGATCCAGTGGGCTTCCCTCCTGACTTTATTTTTGTCTATTGTGGCCT 
TGACTGCCGGGACTAAAACTTTACAGCACAACTTGGCAGGACGTGGATTTCATCACGATGCC 
TTTTTCAGCCCTTCCAATTCCTGCCTTCTTTTCAGAAGTGAGTGTCCCAGAAAAGACAATTG 
TACAGCAAAGGAATGGACTTTTCCTGAAGCTAAATGGAACACCACAGCCAGAGTTTTCAGTC 
ACATCCGTCTTGGCATGGGCCATGTTCTTATTATAGTCCAGTGTTTTATTTCTTCAATGGCT 
AATATGTATAATGAAAAGATACTGAAGGAGGGGAACCAGCTCACTGAAAGCATCTTCATACA 
GAACAGCAAACTCTATTTCTTTGGCATTCTGTTTAATGGGCTGACTCTGGGCCTTCAGAGGA 
GTAACCGTGATCAGATTAAGAACTGTGGATTTTTTTATGGCCACAGTGCATTTTCAGTAGCC 
CTTATTTTTGTAACTGCATTCCAGGGCCTTTCAGTGGCTTTCATTCTGAAGTTCCTGGATAA 
CATGTTCCATGTCTTGATGGCCCAGGTTACCACTGTCATTATCACAACAGTGTCTGTCCTGG 
TCTTTGACTTCAGGCCCTCCCTGGAATTTTTCTTGGAAGCCCCATCAGTCCTTCTCTCTATA 
TTTATTTATAATGCCAGCAAGCCTCAAGTTCCGGAATACGCACCTAGGCAAGAAAGGATCCG 
AGATCTAAGTGGCAATCTTTGGGAGCGTTCCAGTGGGGATGGAGAAGAACTAGAAAGACTTA 
CCAAACCCAAGAGTGATGAGTCAGATGAAGATACTTTCTAACTGGTACCCACATAGTTTGCA 
GCTCTCTTGAACCTTATTTTCACATTTTCAGTGTTTGTAATATTTATCTTTTCACTTTGATA 
AACCAGAAATGTTTCTAAATCCTAATATTCTTTGCATATATCTAGCTACTCCCTAAATGGTT 
CCATCCAAGGCTTAGAGTACCCAAAGGCTAAGAAATTCTAAAGAACTGATACAGGAGTAACA 
ATATGAAGAATTCATTAATATCTCAGTACTTGATAAATCAGAAAGTTATATGTGCAGATTAT 
TTTCCTTGGCCTTCAAGCTTCCAAAAAACTTGTAATAATCATGTTAGCTATAGCTTGTATAT 
ACACATAGAGATCAATTTGCCAAATATTCACAATCATGTAGTTCTAGTTTACATGCCAAAGT 
CTTCCCTTTTTAACATTATAAAAGCTAGGTTGTCTCTTGAATTTTGAGGCCCTAGAGATAGT 
CATTTTGCAAGTAAAGAGCAACGGGACCCTTTCTAAAAACGTTGGTTGAAGGACCTAAATAC 
CTGGCCATACCATAGATTTGGGATGATGTAGTCTGTGCTAAATATTTTGCTGAAGAAGCAGT 
TTCTCAGACACAACATCTCAGAATTTTAATTTTTAGAAATTCATGGGAAATTGGATTTTTGT 
AATAATCTTTTGATGTTTTAAACATTGGTTCCCTAGTCACCATAGTTACCACTTGTATTTTA 
AGTCATTTAAACAAGCCACGGTGGGGCTTTTTTCTCCTCAGTTTGAGGAGAAAAATCTTGAT 
GTCATTACTCCTGAATTATTACATTTTGGAGAATAAGAGGGCATTTTATTTTATTAGTTACT 
AATTCAAGCTGTGACTATTGTATATCTTTCCAAGAGTTGAAATGCTGGCTTCAGAATCATAC 
CAGATTGTCAGTGAAGCTGATGCCTAGGAACTTTTAAAGGGATCCTTTCAAAAGGATCACTT 
AGCAAACACATGTTGACTTTTAACTGATGTATGAATATTAATACTCTAAAAATAGAAAGACC 
AGTAATATATAAGTCACTTTACAGTGCTACTTCACACTTAAAAGTGCATGGTATTTTTCATG 
GTATTTTGCATGCAGCCAGTTAACTCTCGTAGATAGAGAAGTCAGGTGATAGATGATATTAA 
AAATTAGCAAACAAAAGTGACTTGCTCAGGGTCATGCAGCTGGGTGATGATAGAAGAGTGGG 
CTTTAACTGGCAGGCCTGTATGTTTACAGACTACCATACTGTAAATATGAGCTTTATGGTGT 
CATTCTCAGAAACTTATACATTTCTGCTCTCCTTTCTCCTAAGTTTCATGCAGATGAATATA 
AGGTAATATACTATTATATAATTCATTTGTGATATCCACAATAATATGACTGGCAAGAATTG 
GTGGAAATTTGTAATTAAAATAATTATTAAACCT 
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FIGURE 9 

MEKQCCSHPVICSLSTMYTFLLGAIFIALSSSRILLVKYSANEENKYDYLPTTVNVCSELVK 
LVFCVLVSFCVIKKDHQSRNLKYASWKEFSDFMKWS I PAFLYFLDNLI VFYVLSYLQPAMAV 
IFSKTFS I ITTALLFRI VLKRRLNWIQWASLLTLFLS I VALTAGTKTLQHNLAGRGFHHDAFF 
SPSNSCLLFRSECPRKDNCTAKEOTFPEAKWNTTARVFSHIRLGMGHVLIIVQCFISSMANI 
YNEKILKEGNQLTESIFIQNSKLYFFGILFNGLTLGLQRSNRDQIKNCGFFYGHSAFSVALI 
FVTAFQGLSVAFILKFLDNMFHVLMAQVTTVI ITTVSVLVFDFRPSLEFFLEAPSVLLSIFI 
YNASKPQVPEYAPRQERIRDLSGNLWERSSGDGEELERLTKPKSDESDEDTF 
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FIGURE 10 

CGTGCCTGCGCAATGGGTGTCGGGTCCGCTTTTTCCCAATCCGGACGTAATCGTGGTTTTTG 
TTCTGCAATAGGCGGCTTAGAGGGAGGGGCTTTTTCGCCTATACCTACTGTAGCTTCTCCAC 
GTATGGACCCTAAAGGCTACTGCTGCTACTACGGGGCTAGACAGTTACTGTCTCAGCTCTAG 
GATGTGCGTTCTTCCACTAGAAGCTCTTCTGAGGGAGGTAATTAAAAAACAGTGGAATGGAA 
AAACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTCAACAATGTATACATTCCTGCTAGG 
TGCCATATTCATTGCTTTAAGCTCAAGTCGCATCTTACTAGTGAAGTATTCTGCCAATGAAG 
AAAACAAGTATGATTATCTTCCAACTACTGTGAATGTGTGCTCAGAACTGGTGAAGCTAGTT 
TTCTGTGTGCTTGTGTCATTCTGTGTTATAT^AGAAAGATCATCAAAGTAGAAATTTGAAATA 
TGCTTCCTGGAAGGAATTCTCTGATTTCATGAAGTGGTCCATTCCTGCCTTTCTTTATTTCC 
TGGATAACTTGATTGTCTTCTATGTCCTGTCCTATCTTCAACCAGCCATGGCTGTTATCTTC 
TCAAATTTTAGCATTATAACAACAGCTCTTCTATTCAGGATAGTGCTGAAGAGGCGTCTAAA 
CTGGATCCAGTGGGCTTCCCTCCTGACTTTATTTTTGTCTATTGTGGCCTTGACTGCCGGGA 
CTAAAACTTTA 
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FIGURE 11 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGGCCGGCTTGGCTAGCGCGCGGCGGCC 
GTGGCTAAGGCTGCTACGAAGCGAGCTTGGGAGGAGCAGCGGCCTGCGGGGCAGAGGAGCAT 
CCCGTCTACCAGGTCCCAAGCGGCGTGGCCCGCGGGTCATGGCCAAAGGAGAAGGCGCCGAG 
AGCGGCTCCGCGGCGGGGCTGCTACCCACCAGCATCCTCCAAAGCACTGAACGCCCGGCCCA 
GGTGAAGAAAGAACCGAAAAAGAAGAAACAACAGTTGTCTGTTTGCAACAAGCTTTGCTATG 
CACTTGGGGGAGCCCCCTACCAGGTGACGGGCTGTGCCCTGGGTTTCTTCCTTCAGATCTAC 
CTATTGGATGTGGCTCAGGTGGGCCCTTTCTCTGCCTCCATCATCCTGTTTGTGGGCCGAGC 
CTGGGATGCCATCACAGACCCCCTGGTGGGCCTCTGCATCAGCAAATCCCCCTGGACCTGCC 
TGGGTCGCCTTATGCCCTGGATCATCTTCTCCACGCCCCTGGCCGTCATTGCCTACTTCCTC 
ATCTGGTTCGTGCCCGACTTCCCACACGGCCAGACCTATTGGTACCTGCTTTTCTATTGCCT 
CTTTGAAACAATGGTCACGTGTTTCCATGTTCCCTACTCGGCTCTCACCATGTTCATCAGCA 
ACCGAGCAGACTGAGCGGGATTCTGCCACCGCCTATCGGATGACTGTGGAAGTGCTGGGCAC 
AGTGCTGGGCACGGCGATCCAGGGACAAATCGTGGGCCAAGCAGACACGCCTTGTTTCCAGG 
ACTTCAATAGCTCTACAGTAGCTTCACAAAGTGCCAACCATACACATGGCACCACTTCACAC 
AGGGAAACGCAAAAGGCATACCTGCTGGCAGCGGGGGTCATTGTCTGTATCTATATAATCTG 
TGCTGTCATCCTGATCCTGGGCGTGCGGGAGCAGAGAGAACCCTATGAAGCCCAGCAGTCTG 
AGCCAATCGCCTACTTCCGGGGCCTACGGCTGGTCATGAGCCACGGCCCATACATCAAACTT 
ATTACTGGCTTCCTCTTCACCTCCTTGGCTTTCATGCTGGTGGAGGGGAACTTTGTCTTGTT 
TTGCACCTACACCTTGGGCTTCCGCAATGAATTCCAGAATCTACTCCTGGCCATCATGCTCT 
CGGCCACTTTAACCATTCCCATCTGGCAGTGGTTCTTGACCCGGTTTGGCAAGAAGACAGCT 
GTATATGTTGGGATCTCATCAGCAGTGCCATTTCTCATCTTGGTGGCCCTCATGGAGAGTAA 
CCTCATCATTACATATGCGGTAGCTGTGGCAGCTGGCATCAGTGTGGCAGCTGCCTTCTTAC 
TACCCTGGTCCATGCTGCCTGATGTCATTGACGACTTCCATCTGAAGCAGCCCCACTTCCAT 
GGAACCGAGCCCATCTTCTTCTCCTTCTATGTCTTCTTCACCAAGTTTGCCTCTGGAGTGTC 
ACTGGGCATTTCTACCCTCAGTCTGGACTTTGCAGGGTACCAGACCCGTGGCTGCTCGCAGC 
CGGAACGTGTCAAGTTTACACTGAACATGCTCGTGACCATGGCTCCCATAGTTCTCATCCTG 
CTGGGCCTGCTGCTCTTCAAAATGTACCCCATTGATGAGGAGAGGCGGCGGCAGAATAAGAA 
GGCCCTGCAGGCACTGAGGGACGAGGCCAGCAGCTCTGGCTGCTCAGAAACAGACTCCACAG 
AGCTGGCTAGCATCCTCTAGGGCCCGCCACGTTGCCCGAAGCCACCATGCAGAAGGCCACAG 
AAGGGATCAGGACCTGTCTGCCGGCTTGCTGAGCAGCTGGACTGCAGGTGCTAGGAAGGGAA 
CTGAAGACTCAAGGAGGTGGCCCAGGACACTTGCTGTGCTCACTGTGGGGCCGGCTGCTCTG 
TGGCCTCCTGCCTCCCCTCTGCCTGCCTGTGGGGCCAAGCCCTGGGGCTGCCACTGTGAATA 

TAATTAATAACTTAATGACTGTGTACATAGCAATGTGTGTGTATGTATATGTCTGTGAGCTA 
TTAATGTTATTAATTTTCATAAAAGCTGGAAAGC 
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FIGURE 12 

MWLRWALSLPPSSCLWAEPGMPSQTPWWASASANPPGPAWVALCPGSSSPRPWPSLPTSSSG 
SCPTSHTARPIGTCFSIASLKQWSRVSMFPTRLSPCSSATEQTERDSATAYRMTVEVLGTVL 
GTAIQGQIVGQADTPCFQDFNSSTVASQSANHTHGTTSHRETQKAYLLAAGVIVCIYIICAV 
ILILGWEQREPYEAQQSEPIAYFRGLRLVMSHGPYIKLITGFLFTSLAFMLVEGNFVLFCT 
YTLGFRNE FQNLLLAI MLS ATLT I P I WQWFLTRFGKKTAVYVG I S SAVPFL I L VALMESNL I 
ITYAVAVAAGISVAAAFLLPWSMLPDVIDDFHLKQPHFHGTEPIFFSFYVFFTKFASGVSLG 
ISTLSLDFAGYQTRGCSQPERVKFTLNMLVTMAPIVLILLGLLLFKMYPIDEERRRQNKKAL 
QALRDEASSSGCSETDSTELASIL 



WO 99/63088 /<^^\ O PCT/US99/12252 

FIGURE 13 

GGGAAACGCAi\AAGGCATACCTGCTGGCAGCGGGGGTCATTGTCTGTATCTATATAATCTGT 
GCTGTCATCCTGATCCTGGGCGTGCGGGAGCAGAGAGAACCCTATGAAGCCCAGCAGTCTGA 
GCCAATCGCCTACTTCCGGGGCCTACGGCTGGTCATGAGCCACGGCCCATACATCAAACTTA 
TTACTGGCTTCCTCTTCACCTCCTTGGCTTTCATGCTGGTGGAGGGGAACTTTGTCTTGTTT 
TGCACCTACACCTTGGGCTTCCGCAATGAATTCCAGAATCTACTCCTGGCCATCATGCTCTC 
GGCCACTTTAACCATTCCCATCTGGCAGTGGTTCTTGACCCGGTTTGGCAAGAAGACAGCTG 
TATATGTTGGGATCTCATCAGCAGTGCCATTTCTCATCTTGGTGGCCCTCATGGAGAGTAAC 
CTCATCATTACATATGCGGTAGCTGTGGCAGCTGGCATCAGTGTGGCAGCTGCCTTCTTACT 
ACCCTGGTCCATGCTGCCTGATGTCATTGACGACTTCCATCTGAAGCAGCCCCACTTCCATG 
GAACCGAGCCCAT 
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FIfilJRE 14 

GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGT 
ATGAGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGT 
GGTGGTTTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCT 
TTCATATTTTCATACATTACTGCAGTAACACTCCACCATATAGACCCGGCTTTACCTTATAT 
CAGTGACACTGGTACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGG 
CAGTTTTATGCATTGCTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAA 
GAGAACGTTATCATCAAATTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGG 
ACTTTCTATTGTGGCAAACTTCCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTG 
TGCTTACCTTTGGTATGGGCTCATTATATATGTTTGTTCAGACCATCCTTTCCTACCAAATG 
CAGCCCAAAATCCATGGCAAACAAGTCTTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGG 
AGTAAGTGCACTTAGCATGCTGACTTGCTCATCAGTTTTGCACAGTGGCAATTTTGGGACTG 
ATTTAGAACAGAAACTCCATTGGAACCCCGAGGACAAAGGTTATGTGCTTCACATGATCACT 
ACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTTGGTTTTTTCCTGACTTACATTCGTGA 
TTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACATGGATTAACCCTCTATGACACTG 
CACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCAGAGATATTTGATGAAAGGAT 
AAAATATTTCTGTAATGATTATGATTCTCAGGGATTGGGGAAAGGTTCACAGAAGTTGCTTA 
TTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACTGATGAATGCTGATA 
ATCAGGAAACATGAAAGAAGCCATTTGATAGATTATTCTAAAGGATATCATCAAGAAGACTA 
TTAAAAACACCTATGCCTATACTTTTTTATCTCAGAAAATAAAGTCAAAAGACTATG 
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FIGURE 15 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNI 
AAVLCIATIYVRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSG 
AVLTFGMGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFG 
TDLEQKLHWNPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYD 
TAPCP INNERTRLLSRD I 
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FIGURE 16 

CGGACGCTTGGGCNGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGTGCCTGATGCCGAGT 
TCCGTCTCTCGGGTCTTTTCCTGGTCCCAGGCAAAGCGGAGCGGAGATCCTCAAACGGCCTA 
GTGCTTCGCGCTTCCGGAGAAAATCAGCGGTCTAATTAATTCCTCTGGTTTGTTGAAGCAGT 
TACCAAGAATCTTCAACCCTTTCCCACAAAAGCTAATTGAGTACACGTTCCTGTTGAGTACA 
CGTTCCTGTTGATTTACAA/^AGGTGCAGGTATGAGCAGGTCTGAAGACTAACATTTTGTGAA 
GTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGTTTCAGCAAGGCCTCAGTTTCCTTCCT 
TCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTTTCATACATTACTGCAGTAACACT 
CCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGGTACAGTANC 
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FIGURE 17 

CCCACGCGTCCGCCCGCCGCTGCGTCCCGGAGTGCAAGTGAGCTTCTCGGCTGCCCCGCGGG 
CCGGGGTGCGGAGCCGACATGCGCCCGCTTCTCGGCCTCCTTCTGGTCTTCGCCGGCTGCAC 
CTTCGCCTTGTACTTGCTGTCGACGCGACTGCCCCGCGGGCGGAGACTGGGCTCCACCGAGG 
AGGCTGGAGGCAGGTCGCTGTGGTTCCCCTCCGACCTGGCAGAGCTGCGGGAGCTCTCTGAG 
GTCCTTCGAGAGTACCGGAAGGAGCACCAGGCCTACGTGTTCCTGCTCTTCTGCGGCGCCTA 
CCTCTACAAACAGGGCTTTGCCATCCCCGGCTCCAGCTTCCTGAATGTTTTAGCTGGTGCCT 
TGTTTGGGCCATGGCTGGGGCTTCTGCTGTGCTGTGTGTTGACCTCGGTGGGTGCCACATGC 
TGCTACCTGCTCTCCAGTATTTTTGGCAAACAGTTGGTGGTGTCCTACTTTCCTGATAAAGT 
GGCCCTGCTGCAGAGAAAGGTGGAGGAGAACAGAAACAGCTTGTTTTTTTTCTTATTGTTTT 
TGAGACTTTTCCCCATGACACCAAACTGGTTCTTGAACCTCTCGGCCCCAATTCTGAACATT 
CCCATCGTGCAGTTCTTCTTCTCAGTTCTTATCGGTTTGATCCCATATAATTTCATCTGTGT 
GCAGACAGGGTCCATCCTGTCAACCCTAACCTCTCTGGATGCTCTTTTCTCCTGGGACACTG 
TCTTTAAGCTGTTGGCGATTGCCATGGTGGCATTAATTCCTGGAACCCTCATTAAAAAATTT 
AGTCAGAAACATCTGCAATTGAATGAAACAAGTACTGCTAATCATATACACAGTAGAAAAGA 
CACATGATCTGGATTTTCTGTTTGCCACATCCCTGGACTCAGTTGCTTATTTGTGTAATGGA 
TGTGGTCCTCTAAAGCCCCTCATTGTTTTTGATTGCCTTCTATAGGTGATGTGGACACTGTG 
CATCAATGTGCAGTGTCTTTTCAGAAAGGACACTCTGCTCTTGAAGGTGTATTACATCAGGT 
TTTCAAACCAGCCCTGGTGTAGCAGACACTGCAACAGATGCCTCCTAGAAAATGCTGTTTGT 
GGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCCGGTGATTC 
ACAAGGTCAGGAGTTCAAGACCAGCCTGGCCAAGATGGTGAAATCCTGTCTCTAATAAAAAT 
ACAAAAATTAGCCAGGCGTGGTGGCAGGCACCTGTAATCCCAGCTACTCGGGAGGCTGAGGC 
AGGAGAATTGCTTGAACCAAGGTGGCAGAGGTTGCAGTAAGCCAAGATCACACCACTGCACT 
CCAGCCTGGGTGATAGAGTGAGACACTGTCTTGAC 
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FIGURE 18 

MRPLLGLLLVFAGCTFALYLLSTRLPRGRRLGSTEEAGGRSLWFPSDLAELRELSEVLREYR 
KEHQAYVFLLFCGAYLYKQGFAIPGSSFLNVLAGALFGPWLGLLLCCVLTSVGATCCYLLSS 
IFGKQLWSYFPDKVALLQRKVEENRNSLFFFLLFLRLFPMTPNWFLNLSAPILNIPIVQFF 
FSVL IGLI PYNF I CVQTGS I LSTLTSLDALFSWDTVFKLLAI AMVALI PGTLI KKFSQKHLQ 
LNETS TANH I HSRKDT 
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FIGURE 19 

CCGAGGCGGGAGGAGCCCGAGGGGGCGCGAGCCCCGCATGAATCATTGTAGTCAATCATTTT 
CCAGTTCTCAGCCGCTCAGTTGTGATCAAGGGACACGTGGTTT.CCGAACTGCCAGCTCAGAA 
TAGGAAAATAACTTGGGATTTTATATTGGAAGACATGGATCTTGCTGCCAACGAGATCAGCA 
TTTATGACAAACTTTCAGAGACTGTTGATTTGGTGAGACAGACCGGCCATCAGTGTGGCATG 
TCAGAGAAGGCAATTGAAAAATTTATCAGACAGCTGCTGGAAAAGAATGAACCTCAGAGACC 
CCCCCCGCAGTATCCTCTCCTTATAGTTGTGTATAAGGTTCTCGCAACCTTGGGATTAATCT 
TGCTCACTGCCTACTTTGTGATTCAACCTTTCAGCCCATTAGCACCTGAGCCAGTGCTTTCT 
GGAGCTCACACCTGGCGCTCACTCATCCATCACATTAGGCTGATGTCCTTGCCCATTGCCAA 
GAAGTACATGTCAGAAAATAAGGGAGTTCCTCTGCATGGGGGTGATGAAGACAGACCCTTTC 
CAGACTTTGACCCCTGGTGGACAAACGACTGTGAGCAGAATGAGTCAGAGCCCATTCCTGCC 
AACTGCACTGGCTGTGCCCAGAAACACCTGAAGGTGATGCTCCTGGAAGACGCCCCAAGGAA 
ATTTGAGAGGCTCCATCCACTGGTGATCAAGACGGGAAAGCCCCTGTTGGAGGAAGAGATTC 
AGCATTTTTTGTGCCAGTACCCTGAGGCGACAGAAGGCTTCTCTGAAGGGTTTTTCGCCAAG 
TGGTGGCGCTGCTTTCCTGAGCGGTGGTTCCCATTTCCTTATCCATGGAGGAGACCTCTGAA 
CAGATCACAAATGTTACGTGAGCTTTTTCCTGTTTTCACTCACCTGCCATTTCCAAAAGATG 
CCTCTTTAAACAAGTGCTCCTTTCTTCACCCAGAACCTGTTGTGGGGAGTAAGATGCATAAG 
ATGCCTGACCTATTTATCATTGGCAGCGGTGAGGCCATGTTGCAGCTCATCCCTCCCTTCCA 
GTGCCGAAGACATTGTCAGTCTGTGGCCATGCCAATAGAGCCAGGGGATATCGGCTATGTCG 
ACACCACCCACTGGAAGGTCTACGTTATAGCCAGAGGGGTCCAGCCTTTGGTCATCTGCGAT 
GGAACCGCTTTCTCAGAACTGTAGGAAATAGAACTGTGCACAGGAACAGCTTCCAGAGCCGA 
AAACCAGGTTGAAAGGGGAAAAATAAAAACAAAAACGATGAAACTGCAAAAA 
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FIGURE 20 

MDLAANEISIYDKLSETVDLWQTGHQCGMSEKAIEKFIRQLLEKNEPQRPPPQYPLLIVVY 
KVLATLGLILLTAYFVIQPFSPLAPEPVLSGAHTWRSLIHHIRLMSLPIAKKYMSENKGVPL 
HGGDEDRPFPDFDPWWTNDCEQNESEPIPANCTGCAQKHLKVMLLEDAPRKFERLHPLVIKT 
GKPLLEEEIQHFLCQYPEATEGFSEGFFAKWWRCFPERWFPFPYPWRRPLNRSQMLRELFPV 
FTHLPFPKDASLNKCSFLHPEPWGSKMHKMPDLFIIGSGEAMLQLIPPFQCRRHCQSVAMP 
I EPGDI GYVDTTHWKVYVI ARGVQPLVI CDGTAFSEL 
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FIGURE 21 

CCACGGTGTCCGTTCTTCGCCCGGCGGCAGCTGTCCCCGAGGCGGGAGGAGCCCGAGGGGCG 
CGAGCCCCGCATGAATCATTGTAGTCAATCATTTTCCAGTTCTCAGCCGTTCAGTTGTGATC 
AAGGGACACGTGGTTTCCGAACTGCCAGCTCAGAATAGGAAAATAACTTGGGATTTTATATT 
GGAAGACATGGATCTTGCTGCCAACGAGATCAGCATTTATGACAAACTTTCAGAGACTGTTG 
ATTTGGTGAGACAGACCGGCCATCAGTGTGGCATGTCAGAGAAGGCAATTGAAAAATTTATC 
AGACAGCTGCTGGAAAAGAATGAACCTCAGAGACCCCCCCCGCAGTATCCTCTCCTTATAGT 
TGTGTATAAGGTTCTCGCAACCTTGGGATTAATCTTGCTCACTGCCTACTTTGTGATTCAAC 
CTTTCAGCCCATTAGCACCTGAGCCAGTGCTTTGTGGAGCTCAC 
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FTGIIRE 22 

CCCACGCGTCCGCCCACGCGTCCGGCTGAACACCTCTTCTTTGGAGTCAGCCACTGATGAGG 
CAGGGTCCCCACTTGCAGCTGCAGCAGCTGCAGCAGCTGCAGAGCGCTGCTCCTGGCTGGTG 
CCACTGGTGCGCACGCTGCTAGACCGTGCCTATGAGCCGCTGGGGCTGCAGTGGGGACTGCC 
CTCCCTGCCACCCACCAATGGCAGCCCCACCTTCTTTGAAGACTTCCAGGCTTTTTGTGCCA 
CACCCGAATGGCGCCACTTCATCGACAAACAGGTACAGCCAACCATGTCCCAGTTCGAAATG 
GACACGTATGCTAAGAGCCACGACCTTATGTCAGGTTTCTGGAATGCCTGCTATGACATGCT 
TATGAGCAGTGGGCAGCGGCGCCAGTGGGAGCGCGCCCAGAGTCGTCGGGCCTTCCAGGAGC 
TGGTGCTGGAACCTGCGCAGAGGCGGGCGCGCCTGGAGGGGCTACGCTACACGGCAGTGCTG 
AAGCAGCAGGCAACGCAGCACTCCATGGCCCTGCTGCACTGGGGGGCGCTGTGGCGCCAGCT 
CGCCAGCCCATGTGGGGCCTGGGCGCTGAGGGACACTCCCATCCCCCGCTGGAAACTGTCCA 
GCGCCGAGACATATTCACGCATGCGTCTGAAGCTGGTGCCCAACCATCACTTCGACCCTCAC 
CTGGAAGCCAGCGCTCTCCGAGACAATCTGGGTGAGGTTCCCCTGACACCCACCGAGGAGGC 
CTCACTGCCTCTGGCAGTGACCAAAGAGGCCAAAGTGAGCACCCCACCCGAGTTGCTGCAGG 
AGGACCAGCTCGGCGAGGACGAGCTGGCTGAGCTGGAGACCCCGATGGAGGCAGCAGAACTG 
GATGAGCAGCGTGAGAAGCTGGTGCTGTCGGCCGAGTGCCAGCTGGTGACGGTAGTGGCCGT 
GGTCCCAGGGCTGCTGGAGGTCACCACACAGAATGTATACTTCTACGATGGCAGCACTGAGC 
GCGTGGAAACCGAGGAGGGCATCGGCTATGATTTCCGGCGCCCACTGGCCCAGCTGCGTGAG 
GTCCACCTGCGGCGTTTCAACCTGCGCCGTTCAGCACTTGAGCTCTTCTTTATCGATCAGGC 
CAACTACTTCCTCAACTTCCCATGCAAGGTGGGCACGACCCCAGTCTCATCTCCTAGCCAGA 
CTCCGAGACCCCAGCCTGGCCCCATCCCACCCCATACCCAGGTACGGAACCAGGTGTACTCG 
TGGCTCCTGCGCCTACGGCCCCCCTCTCAAGGCTACCTAAGCAGCCGCTCCCCCCAGGAGAT 
GCTGCGTGCCTCAGGCCTTACCCAGAAATGGGTACAGCGTGAGATATCCAACTTCGAGTACT 
TGATGCAACTCAACACCATTGCGGGGCGGACCTACAATGACCTGTCTCAGTACCCTGTGTTC 
CCCTGGGTCCTGCAGGACTACGTGTCCCCAACCCTGGACCTCAGCAACCCAGCCGTCTTCCG 
GGACCTGTCTAAGCCCATCGGTGTGGTGAACCCCAAGCATGCCCAGCTCGTGAGGGAGAAGT 
ATGAAAGCTTTGAGGACCCAGCAGGGACCATTGACAAGTTCCACTATGGCACCCACTACTCC 
AATGCAGCAGGCGTGATGCACTACCTCATCCGCGTGGAGCCCTTCACCTCCCTGCACGTCCA 
GCTGCAAAGTGGCCGCTTTGACTGCTCCGACCGGCAGTTCCACTCGGTGGCGGCAGCCTGGC 
AGGCACGCCTGGAGAGCCCTGCCGATGTGAAGGAGCTCATCCCGGAATTCTTCTACTTTCCT 
GACTTCCTGGAGAACCAGAACGGTTTTGACCTGGGCTGTCTCCAGCTGACCAACGAGAAGGT 
AGGCGATGTGGTGCTACCCCCGTGGGCCAGCTCTCCTGAGGACTTCATCCAGCAGCACCGCC 
AGGCTCTGGAGTCGGAGTATGTGTCTGCACACCTACACGAGTGGATCGACCTCATCTTTGGC 
TACAAGCAGCGGGGGCCAGCCGCCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGA 
GGGGGCTGTAGACCTGGACCATGTGACAGATGAGCGGGAACGGAAGGCTCTGGAGGGCATTA 
TCAGCAACTTTGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCTCTCA 
GCTGAGGAAGCAGCCCATCGCCTTGCACGCCTGGACACTAACTCACCTAGCATCTTCCAGCA 
CCTGGACGAACTCAAGGCATTCTTCGCAGAGGTGACTGTGAGTGCCAGTGGGCTGCTGGGCA 
CCCACAGCTGGTTGCCCTATGACCGCAACATAAGCAACTACTTCAGCTTCAGCAAAGACCCC 
ACCATGGGCAGCCACAAGACGCAGCGACTGCTGAGTGGCCCGTGGGTGCCAGGCAGTGGTGT 
GAGTGGACAAGCACTGGCAGTGGCCCCGGATGGAAAGCTGCTATTCAGCGGTGGCCACTGGG 
ATGGCAGCCTGCGGGTGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTGCCAC 
CTTGATGTAGTAACCTGCCTTGCACTGGACACCTGTGGCATCTACCTCATCTCAGGCTCCCG 
GGACACCACGTGCATGGTGTGGCGGCTCCTGCATCAGGGTGGTCTGTCAGTAGGCCTGGCAC 
CAAAGCCTGTGCAGGTCCTGTATGGGCATGGGGCTGCAGTGAGCTGTGTGGCCATCAGCACT 
GAACTTGACATGGCTGTGTCTGGATCTGAGGATGGAACTGTGATCATACACACTGTACGCCG 
CGGACAGTTTGTAGCGGCACTACGGCCTCTGGGTGCCACATTCCCTGGACCTATTTTCCACC 
TGGCATTGGGGTCCGAAGGCCAGATTGTGGTACAGAGCTCAGCGTGGGAACGTCCTGGGGCC 
CAGGTCACCTACTCCTTGCACCTGTATTCAGTCAATGGGAAGTTGCGGGCTTCACTGCCCCT 
GGCAGAGCAGCCTACAGCCCTGACGGTGACAGAGGACTTTGTGTTGCTGGGCACCGCCCAGT 
GCGCCCTGCACATCCTCCAACTAAACACACTGCTCCCGGCCGCGCCTCCCTTGCCCATGAAG 
GTGGCCATCCGCAGCGTGGCCGTGACCAAGGAGCGCAGCCACGTGCTGGTGGGCCTGGAGGA 
TGGCAAGCTCATCGTGGTGGTCGCGGGGCAGCCCTCTGAGGTGCGCAGCAGCCAGTTCGCGC 
GGAAGCTGTGGCGGTCCTCGCGGCGCATCTCCCAGGTGTCCTCGGGAGAGACGGAATACAAC 
CCTACTGAGGCGCGCTgAACCTGGCCAGTCCGGCTGCTCGGGCCCCGCCCCCGGCAGGCCTG 
GCCCGGGAGGCCCCGCCCAGAAGTCGGCGGGAACACCCCGGGGTGGGCAGCCCAGGGGGTGA 
GCGGGGCCCACCCTGCCCAGCTCAGGGATTGGCGGGCGATGTTACCCCCTCAGGGATTGGCG 
GGCGGAAGTCCCGCCCCTCGCCGGCTGAGGGGCCGCCCTGAGGGCCAGCACTGGCGTCT 



WO 99/63088 £3/3 AO PCT/US99/12252 

FIGURE 23 

MSQFEMDTYAKSHDLMSGFWNAC¥T)ML 

RYTAVLKQQATQHSMALLHWGALWRQLASPCGAWALRDTPIPRWKLSSAETYSRMRLKLVPN 

HHFDPHLEASALRDNLGEVPLTPTEEASLPLAVTKEAKVSTPPELLQEDQLGEDELAELETP 

MEAAELDEQREKLVLSAECQLVTWAWPGLLEVTTQNVYFYDGSTERVETEEGIGYDFRRP 

LAQLRE VHLRRFNLRRS ALELFF I DQANYFLNFP CKVGTTPVSS PSQTPRPQPGP I PPHTQV 

RNQVYSWLLRLRPPSQGYLSSRSPQEMLRASGLTQKWVQREISNFEYLMQLNTIAGRTYNDL 

SQYPWPWVLQDYVSPTLDLSNPAVFRDLSKPIGVVNPKHAQLVREKYESFEDPAGTIDKFH 

YGTHYSNAAGVMHYLIRVEPFTSLHVQLQSGRFDCSDRQFHSVAAAWQARLESPADVKELIP 

EFFYFPDFLENQNGFDLGCLQLTNEKVGDWLPPWASSPEDFIQQHRQALESEYVSAHLHEW 

IDLI FGYKQRGPAAEEALNVFYYCTYEGAVDLDHVTDERERKALEGI I SNFGQTPCQLLKEP 

HPTRLSAEEAAHRLARLDTNSPSIFQHLDELKAFFAEVTVSASGLLGTHSWLPYDRNISNYF 

SFSKDPTMGSHKTQRLLSGPWVPGSGVSGQALAVAPDGKLLFSGGHWDGSLRVTALPRGKLL 

SQLSCHLDWTCJ^DTCGIYLISGSRDTTCNTVWRLLHQGGLSVGLAPKPVQVLYGHGAAVS 

CVAISTELDMAVSGSEDGTVIIHTVRRGQFVAALRPLGATFPGPIFHLALGSEGQIWQSSA 

WERPGAQVTYSLHLYSVNGKLRASLPLAEQPTALTVTEDFVLLGTAQCALHILQLNTLLPAA 

PPLPMKVAIRSVAVTKERSHVLVGLEDGKLIVWAGQPSEVRSSQFARKLWRSSRRISQVSS 

GETEYNPTEAR 
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CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCC 

CACGGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCAT 

CCAAAGGCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTC 

TGGACCCTTAACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTT 

CTACTGGGCCTTCCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCC 

GCACACTCCGTTACCACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAG 

ATAGCCCGGGTCATCTTGGAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGC 

CCGCTGCATCATGTGCTGTTTCAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCC 

TAAACCGCAATGCATACATCATGATCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAA 

AATGCGTTCATGCTACTCATGCGAAACATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGA 

CCTGCTGCTGTTCTTTGGGAAGCTGCTGGTGGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTT 

TTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAGACTTTAAGAGCCCCCACCTCAACTATTAC 

TGGCTGCCCATCATGACCTCCATCCTGGGGGCCTATGTCATCGCCAGCGGCTTCTTCAGCGT 

TTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCTGGAAGACCTGGAGCGGAACAACG 

GCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAAAGATTCTGGGCAAGAAGAAC 

GAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGGCCCTGATCCAGGACTGC 

ACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGTCTCCATTTTGTGGT 

AAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACTTTGAGAGGCTG 

AGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCTCC 

GTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCAGCTAC 

TCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGAGA 

TCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAA 

AAAGATTTTATTAAAGATATTTTGTTAACTC 
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FIGURE 25 

RTRGRTRGGCEKVP I NTSCNPTAHLVNS S C PGLMCVFQGYS S KGL I QRS VFNLQ IYGVLGLF 
WTLNWVLALGQCVLAGAFAS F YWAFHKPQD I PTFPL I S AFI RTLRYHTGSLAFGAL I LTLVQ 
IARVILEYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAK 
NAFMLLMRNIVRVVVLDKVTDLLLFFGKLLVVGGVGVLSFFFFSGRIPGLGKDFKSPHLNYY 
WLPIMTSILGAYVIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKN 
EAP PDNKKRKK 



WO 99/63088 ~2 <f /S-^ O PCT/US99/122S2 

FIGURE 26 

GAGTCTTGACCGCCGCCGGGCTCTTGGTACCTCAGCGCGAGCGCCAGGCGTCCGGCCGCCGT 

GGCTATGTTCGTGTCCGATTTCCGCAAAGAGTTCTACGAGGTGGTCCAGAGCCAGAGGGTCC 

TTCTCTTCGTGGCCTCGGACGTGGATGCTCTGTGTGCGTGCAAGATCCTTCAGGCCTTGTTC 

CAGTGTGACCACGTGCAATATACGCTGGTTCCAGTTTCTGGGTGGCAAGAACTTGAAACTGC 

ATTTCTTGAGCATAAAGAACAGTTTCATTATTTTATTCTCATAAACTGTGGAGCTAATGTAG 

ACCTATTGGATATTCTTCAACCTGATGAAGACACTATATTCTTTGTGTGTGACTCCCATAGG 

C CAGTC AATGTCGT CAATGT ATAC AACGAT AC C C AG AT CAAATTACT CAT T AAACAAGATGA 

TGACCTTGAAGTTCCCGCCTATGAAGACATCTTCAGGGATGAAGAGGAGGATGAAGAGCATT 

CAGGAAATGACAGTGATGGGTCAGAGCCTTCTGAGAAGCGCACACGGTTAGAAGAGGAGATA 

GTGGAGCAAACCATGCGGAGGAGGCAGCGGCGAGAGTGGGAGGCCCGGAGAAGAGACATCCT 

CTTTGACTACGAGCAGTATGAATATCATGGGACATCGTCAGCCATGGTGATGTTTGAGCTGG 

CTTGGATGCTGTCCAAGGACCTGAATGACATGCTGTGGTGGGCCATCGTTGGACTAACAGAC 

CAGTGGGTGCAAGACAAGATCACTCAAATGAAATACGTGACTGATGTTGGTGTCCTGCAGCG 

CCACGTTTCCCGCCACAACCACCGGAACGAGGATGAGGAGAACACACTCTCCGTGGACTGCA 

CACGGATCTCCTTTGAGTATGACCTCCGCCTGGTGCTCTACCAGCACTGGTCCCTCCATGAC 

AGCCTGTGCAACACCAGCTATACCGCAGCCAGGTTCAAGCTGTGGTCTGTGCATGGACAGAA 

GCGGCTCCAGGAGTTCCTTGCAGACATGGGTCTTCCCCTGAAGCAGGTGAAGCAGAAGTTCC 

AGGCCATGGACATCTCCTTGAAGGAGAATTTGCGGGAAATGATTGAAGAGTCTGCAAATAAA 

TTTGGGATGAAGGACATGCGCGTGCAGACTTTCAGCATTCATTTTGGGTTCAAGCACAAGTT 

TCTGGCCAGCGACGTGGTCTTTGCCACCATGTCTTTGATGGAGAGCCCCGAGAAGGATGGCT 

CAGGGACAGATCACTTCATCCAGGCTCTGGACAGCCTCTCCAGGAGTAACCTGGACAAGCTG 

TACCATGGCCTGGAACTCGCCAAGAAGCAGCTGCGAGCCACCCAGCAGACCATTGCCAGCTGC 

CTTTGCACCAACCTCGTCATCTCCCAGGGGCCTTTCCTGTACTGCTCTCTCATGGAGGGCAC 

TCCAGATGTCATGCTGTTCTCTAGGCCGGCATCCCTAAGCCTGCTCAGCAAACACCTGCTCA 

AGTCCTTTGTGTGTTCGACAAAGAACCGGCGCTGCAAACTGCTGCCCCTGGTGATGGCTGCC 

CCCCTGAGCATGGAGCATGGCACAGTGACCGTGGTGGGCATCCCCCCAGAGACCGACAGCTC 

GGACAGGAAGAACTTTTTTGGGAGGGCGTTTGAGAAGGCAGCGGAAAGCACCAGCTCCCGGA 

TGCTGCACAACCATTTTGACCTCTCAGTAATTGAGCTGAAAGCTGAGGATCGGAGCAAGTTT 

CTGGACGCACTTATTTCCCTCCTGTCCTAGGAATTTGATTCTTCCAGAATGACCTTCTTATT 

TATGTAACTGGCTTTCATTTAGATTGTAAGTTATGGACATGATTTGAGATGTAGAAGCCATT 

TTTTATTAAATAAAATGCTTATTTTAGGAAA 
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FIGURE 27 

MFVSDFRKEFYEWQSQRVLLFVASDVDALCACKILQALFQCDHVQYTLVPVSGWQELETAF 
LEHKEQFHYFILINCGANVDLLDILQPDEDTIFFVCDSHRPVNWNVYNDTQIKLLIKQDDD 
LEVPAYEDIFRDEEEDEEHSGNDSDGSEPSEKRTRLEEEIVEQTMRRRQRREWEARRRDILF 
DYEQYEYHGTSSAMVMFELAWMLSKDLNDMLW^^ 

VSRHNHRNEDEENTLSVDCTRISFEYDLRLVLYQHWSLHDSLCNTSYTAARFKLWSVHGQKR 
LQEFLADMGLPLKQVKQKFQAMDISLKENLREMIEESANKFGMKDMRVQTFSIHFGFKHKFL 
ASDWFATMSLMESPEKDGSGTDHFIQALDSLSRSNLDKLYHGLELAKKQLRATQQTIASCL 
CTNLVISQGPFLYCSLMEGTPDVMLFSRPASLSLLSKHLLKSFVCSTKNRRCKLLPLVMAAP 
LSMEHGTVTWGIPPETDSSDRKNFFGRAFEKAAESTSSRMLHNHFDLSVIELKAEDRSKFL 
DALISLLS 
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FIGURE 28 

GTACCTCAGCGCGAGCGCCAGGCGTCCGGCCGCCGTGGCTATGNTCGTGTCCGATTTCCGCA 
AAGAGTTCTACGAGGTGGTCCAGAGCCAGAGGGTCCTTCTCTTCGTGGCCTCGGANGTGGAT 
GCTCTGTGTGCGTGCAAGATCCTTCAGGCCTTGTTCCAGTGTGACCANGTGCAATATANGCT 
GGTTCCAGTTTCTGGGTGGCAAGAACTTGAAACTGCATTTCTTGAGCATAAAGAACAGTTTC 
ATTATTTTATTCTCATAAACTGTGGAGCTAATGTAGACCTATTGGATATTCTTCAACCTGAT 
GAAGACACTATATTCTTTGTGTGTGACACCCATAGGCCAGTCAATGTTGTCAATGTATACAA 
CGATACCC 
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FIGURE 29 

CAGGAACCCTCTCTTTGGGTCTGGATTGGGACCCCTTTCCAGTACCATTTTTTCTAGTGAAC 
CACGAAGGGACGATACCAGAAAACACCCTCAACCCAAAGGAAATAGACTACAGCCCCAATTG 
GCTGACTTTGGCTATAGAAAAAAGAAAGGAACGAAAAGAGACAGTTTTTTTTGGAAAGCTAA 
GTCTTCCCTTTATCGAGTCAAGAAACCCCCCCTTCTTGAGCTATTTACAGCTTTTAACAATT 
GAGTAAAGTACGCTCCGGTCACCATGGTGACAGCCGCCCTGGGTCCCGTCTGGGCAGCGCTC 
CTGCTCTTTCTCCTGATGTGTGAGATCCGTATGGTGGAGCTCACCTTTGACAGAGCTGTGGC 
CAGCGGCTGCCAACGGTGCTGTGACTCTGAGGACCCCCTGGATCCTGCCCATGTATCCTCAG 
CCTCTTCCTCCGGCCGCCCCCACGCCCTGCCTGAGATCAGACCCTACATTAATATCACCATC 
CTGAAGGGTGACAAAGGGGACCCAGGCCCAATGGGCCTGCCAGGGTACATGGGCAGGGAGGG 
TCCCCAAGGGGAGCCTGGCCCTCAGGGCAGCAAGGGTGACAAGGGGGAGATGGGCAGCCCCG 
GCGCCCCGTGCCAGAAGCGCTTCTTCGCCTTCTCAGTGGGCCGCAAGACGGCCCTGCACAGC 
GGCGAGGACTTCCAGACGCTGCTCTTCGAAAGGGTCTTTGTGAACCTTGATGGGTGCTTTGA 
CATGGCGACCGGCCAGTTTGCTGCTCCCCTGCGTGGCATCTACTTCTTCAGCCTCAATGTGC 
ACAGCTGGAATTACAAGGAGACGTACGTGCACATTATGCATAACCAGAAAGAGGCTGTCATC 
CTGTACGCGCAGCCCAGCGAGCGCAGCATCATGCAGAGCCAGAGTGTGATGCTGGACCTGGC 
CTACGGGGACCGCGTCTGGGTGCGGCTCTTCAAGCGCCAGCGCGAGAACGCCATCTACAGCA 
ACGACTTCGACACCTACATCACCTTCAGCGGCCACCTCATCAAGGCCGAGGACGACTGAGGG 
CCTCTGGGCCACCCTCCCGGCTGGAGAGCTCAGGTGCTGGTCCCGTCCCCTGCAGGGCTCAG 
TTTGCACTGCTGTGAAGCAGGAAGGCCAGGGAGGTCCCCGGGGACCTGGCATTCTGGGGAGA 
CCCTGCTTCTATCTTGGCTGCCATCATCCCTCCCAGCCTATTTCTGCTCCTCTCTTCTCTCT 
TGGACCTATTTTAAGAAGCTTGCTAACCTAAATATTCTAGAACTTTCCCAGCCTCGTAGCCC 
AGCACTTCTCAAACTTGGAAATGCATGCGAATCACCCGGGGTTCGTGTTAAATGCAGATTCT 
GACTCAGCAGGTCTGAGTGGGTCCAGGATTCTGTGTTTCTCATATGTTCCTGGGTGATGCTG 
ATGGGGTCAGTCTATGAACCACACTGGAGCAACCAGGTTCTAGGACTTTCTCAATATTCTAG 
TACTTTCTGAACATTCTGGAATCCTCCCCACATTCTAGAATTCTCCCAACATTTTTTTTTCT 
TGAGACAGAGTCTTGCTCTGTTGCCCAGGCTAGAGTGCAGTGGTGCAATCTCAGTTCACTGC 
AACCTCTGCCTCCCGGGTTCAAGCGATTCTTCTGCCTCAGCCTCCCTAGTGGCTGGGATTAC 
AGGCGCCTGCTACCATGCCTGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATA 
TTGGCCAGGCTGGTCTTGAACTCCTGACTTCAGGTGACCCACCCGCCTCGGCCTCTCAAAAT 
GCTGGGATTACAGGTGTGAGCCACCGTGCCTGGCCAATTCCAACATTCTTAAATTCTCTCAT 
CCCTCCAGGGCTCCCCGTGCTATGTTCTCTTTACCCCTTCCCCCTCTTCTCTTGCTCAGGCC 
TGCACCACTGCAGCCACCGTTCATTTATTCATTCATTAAACACTGAGCACTCACTCTGTGCT 
GGGTCCCGGGAAGGGTGAGGGGGTCAGACACAGGCCCTGCCCCTGCCCTCAGTGACTGGCCA 
GTCCAGCCCAGGCGGGGAGAGATGTGTACATAGGTTTTAAAGCAGACCCAGAGCTCATGGGG 
GCCTGTGTTCTGGGTGTTCAGGTGCTGCTGGTCCTCCATTACCCACTGCTCCCCAAGGCTGG 
TGGGACGGGGTCCCGGTGGCAGGGGCAGGTATCTCCTTCCCGTTCCTCATCCACCTGCCCAG 
TGCTCATCGTTACAGCAAACCCCAGGGGGCCTTGGCCAGGTCAAGGGTTCTGTGAGGAGAGG 
ACCCAGGAGTGTGGGGGCATTTGGGGGGTGAAGTGGCCCCCGAAGAATGGAACCCACACCCA 
TAGCTCTCCCCACAGCTGATACGGCATCCTGCGAGAAGACCTGCCCTCCTCACTGGGATCCC 
CTTCCTGCCTCCTCCCAGGGCTCTGCCAGGGCCTTGCTCAGTCCCTTCCACCAAAGTCATCT 
GAACTTCCGTTTCCCCAGGGCCTCCAGCTGCCCTCAGACACTGATGTCTGTCCCCAGGTGCT 
CTCTGCCCCTCATGCCCCTCTCACCGGCCCAGTGCCCCGACTCTCCAGGCTTTATCAAGGTG 
CTAAGGCCCGGGTGGGCAGCTCCTCGTCTCAGAGCCCTCCTCCGGCCTGGTGCTGCCTTTAC 
AAACACCTGCAGGAGAAGGGCCACGGAAGCCCCAGGCTTTAGAGCCCTCAGCAGGTCTGGGG 
AGCTAGAGCAAAGGAGGGACCTCAGGCCTTCCGTTTCTTCTTCCAGGGTGGGGTGGCCTGGT 
GTTCCCCTAGCCTTCCAAACCCAGGTGGCCTGCCCTTCTCCCCAGAGGGAGGCGGCCTCCGC 
CCATTGGTGCTCATGCAGACTCTGGGGCTGAGGTGCCCCGGGGGGTGATCTCTGGTGCTCAC 
AGCCGAGGGAGCCGTGGCTCCATGGCCAGATGACGGAAACAGGGTCTGACCAAGTGCCAGGA 
AGACCTGTGCTATAAACCACCCTGCCTGATCCTGCCCCTGCCTGACCCCGCCACGCCCTGCC 
GTCCAGCATGATTAAAGAATGCTGTCTCCTCTTGGAAAAAAAAAAAAAAAA 



WO99«30M PCT/USJWIHB 

FIGURE 30 

MVTAMjGPVWAALLLFLLMCEIRMVELTFDRAVASGCQRCCDSEDPLDPAHVSSASSSGRPH 
ALPEIRPYINITILKGDKGDPGPMGLPGYMGREGPQGEPGPQGSKGDKGEMGSPGAPCQKRF 
FAFSVGRKTALHSGEDFQTLLFERVFVNLDGCFDMATGQFAAPLRGIYFFSLNVHSWNYKET 
YVHIMHNQKEAVILYAQPSERSIMQSQSVMLDLAYGDRVWVRLFKRQRENAIYSNDFDTYIT 
FSGHL I KAEDD 



Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 72-75 

Clq domain proteins. 

amino acids 144-178, 78-111 and 84-117 
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ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTG 
AGGCCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTG 
CTCCAGGGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGG 
GCTGCCCATCCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGG 
ACCACGGTGCCCCGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCAT 
CACCATGCTCGACGCAGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCAC 
AGAACCAGATCGCCAGCCTGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTG 
GACCTGACGGCCAACAGGCTGCATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCT 
CGAGCGCCTCTACCTGGGCAAGAACCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGC 
TCGACCGCCTCCTGGAGCTCAAGCTGCAGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGC 
CTGCCCCGCCTGCTGCTGCTGGACCTCAGCCACAACAGCCTCCTGGCCCTGGAGCCCGGCAT 
CCTGGACACTGCCAACGTGGAGGCGCTGCGGCTGGCTGGTCTGGGGCTGCAGCAGCTGGACG 
AGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACCTGGATGTGTCCGACAACCAGCTGGAG 
CGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACGCGCCTGCGGCTGGCCGGCAACAC 
CCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGCTGCCCTGCAGGAGCTGGATG 
TGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCCTCTTCCCCCGCCTGCGG 
CTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGCTGGTTTGGCCCCTG 
GGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCACTTCCCGCCCA 
AGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAGCCACCACC 
ACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCTTCTAG 
CTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTCCA 
CTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTC 
AATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTT 
CACGGGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCA 
CGCCGAGGCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGC 
GTGGGGCTGCAGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTA 
TCGCAACCTATCGGGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTG 
AGTACACGGTCACCCAGCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGG 
CCCGGGCGGGTGCCGGAGGGCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCA 
CTCCAACCACGCCCCAGTCACCCAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCG 
CCCTGGCCGCGGTGCTCCTGGCCGCGCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGG 
GGGCGGGCCATGGCAGCAGCGGCTCAGGACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCT 
GGAACTGGAGGGAGTGAAGGTCCCCTTGGAGCCAGGCCCGAAGGCAACAGAGGGCGGTGGAG 
AGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCCACTCATGGGCTTCCCAGGGCCTGGCCTC 
CAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCAGAGAGAGACAGGGCAGCTGGGGCCG 
GGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCCACACCACGTAAGTTCTCAGTCC 
CAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCTGGGCCCTGTTCCCTCTGGA 
CCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCCCTAACGTCCCCAGAAC 
CGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTCCCTGGGCACGGCG 
GGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCACTCCAGGCGGA 
CCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGCGGCTGTG 
TGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGCTTTA 
GGAACATGTTTTGCTTTTTTAAAATATATATATTTATAAGAGATCCTTTCCCATTTATTCTG 
GGAAGATGTTTTTCAAACTCAGAGACAAGGACTTTGGTTTTTGTAAGACAAACGATGATATG 
AAGGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 32 

MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFEN 
GITMLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLR 
RLERLYLGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEP 
GILDTANVEALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAG 
NTRIAQLRPEDLAGIiAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFG 
PWVRESHVTLASPEETRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALS 
SSLAPTWLSPTAPATEAPSPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPE 
GFTGLYCESQMGQGTRPSPTPVTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRL 
TYRNLSGPDKRLVTLRLPASLAEYTVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPA 
VHSNHAPVTQAREGNLPLLIAPALAAVLI^^ 

PLELEGVKVPLEPGPKATEGGGEALPSGSECEVPLMGFPGPGLQSPLHAKPYI 
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FIGURE 33 

GAATCATCCACGCACCTGCAGCTCTGCTGAGAGAGTGCAAGCCGTGGGGGTTTTGAGCTCAT 
CTTCATCATTCATATGAGGAAATAAGTGGTAAAATCCTTGGAAATACA ATGA GACTCATCAG 
AAACATTTACATATTTTGTAGTATTGTTATGACAGCAGAGGGTGATGCTCCAGAGCTGCCAG 
AAGAAAGGGAACTGATGACCAACTGCTCCAACATGTCTCTAAGAAAGGTTCCCGCAGACTTG 
ACCCCAGCCACAACGACACTGGATTTATCCTATAACCTCCTTTTTCAACTCCAGAGTTCAGA 
TTTTCATTCTGTCTCCAAACTGAGAGTTTTGATTCTATGCCATAACAGAATTCAACAGCTGG 
ATCTCAAAACCTTTGAATTCAACAAGGAGTTAAGATATTTAGATTTGTCTAATAACAGACTG 
AAGAGTGTAACTTGGTATTTACTGGCAGGTCTCAGGTATTTAGATCTTTCTTTTAATGACTT 
TGACACCATGCCTATCTGTGAGGAAGCTGGCAACATGTCACACCTGGAAATCCTAGGTTTGA 
GTGGGGCAAAAATACAAAAATCAGATTTCCAGAAAATTGCTCATCTGCATCTAAATACTGTC 
TTCTTAGGATTCAGAACTCTTCCTCATTATGAAGAAGGTAGCCTGCCCATCTTAAACACAAC 
AAAACTGCACATTGTTTTACCAATGGACACAAATTTCTGGGTTCTTTTGCGTGATGGAATCA 
AGACTTCAAAAATATTAGAAATGACAAATATAGATGGCAAAAGCCAATTTGTAAGTTATGAA 
ATGCAACGAAATCTTAGTTTAGAAAATGCTAAGACATCGGTTCTATTGCTTAATAAAGTTGA 
TTTACTCTGGGACGACCTTTTCCTTATCTTACAATTTGTTTGGCATACATCAGTGGAACACT 
TTCAGATCCGAAATGTGACTTTTGGTGGTAAGGCTTATCTTGACCACAATTCATTTGACTAC 
TCAAATACTGTAATGAGAACTATAAAATTGGAGCATGTACATTTCAGAGTGTTTTACATTCA 
ACAGGATAAAATCTATTTGCTTTTGACCAAAATGGACATAGAAAACCTGACAATATCAAATG 
CACAAATGCCACACATGCTTTTCCCGAATTATCCTACGAAATTCCAATATTTAAATTTTGCC 
AATAATATCTTAACAGACGAGTTGTTTAAAAGAACTATCCAACTGCCTCACTTGAAAACTCT 
CATTTTGAATGGCAATAAACTGGAGACACTTTCTTTAGTAAGTTGCTTTGCTAACAACACAC 
CCTTGGAACACTTGGATCTGAGTCAAAATCTATTACAACATAAAAATGATGAAAATTGCTCA 
TGGCCAGAAACTGTGGTCAATATGAATCTGTCATACAATAAATTGTCTGATTCTGTCTTCAG 
GTGCTTGCCCAAAAGTATTCAAATACTTGACCTAAATAATAACCAAATCCAAACTGTACCTA 
AAGAGACTATTCATCTGATGGCCTTACGAGAACTAAATATTGCATTTAATTTTCTAACTGAT 
CTCCCTGGATGCAGTCATTTCAGTAGACTTTCAGTTCTGAACATTGAAATGAACTTCATTCT 
CAGCCCATCTCTGGATTTTGTTCAGAGCTGCCAGGAAGTTAAAACTCTAAATGCGGGAAGAA 
ATCCATTCCGGTGTACCTGTGAATTAAT^AAATTTCATTCAGCTTGAAACATATTCAGAGGTC 
ATGATGGTTGGATGGTCAGATTCATACACCTGTGAATACCCTTTAAACCTAAGGGGAACTAG 
GTTAAAAGACGTTCATCTCCACGAATTATCTTGCAACACAGCTCTGTTGATTGTCACCATTG 
TGGTTATTATGCTAGTTCTGGGGTTGGCTGTGGCCTTCTGCTGTCTCCACTTTGATCTGCCC 
TGGTATCTCAGGATGCTAGGTCAATGCACACAAACATGGCACAGGGTTAGGAAAACAACCCA 
AGAACAACT C AAGAGAAATG TC CGATTC CACGCATTT ATTT CAT ACAGTGAA CATG AT TC T C 
TGTGGGTGAAGAATGAATTGATCCCCAATCTAGAGAAGGAAGATGGTTCTATCTTGATTTGC 
CTTTATGAAAGCTACTTTGACCCTGGCAAAAGCATTAGTGAAAATATTGTAAGCTTCATTGA 
GAAAAGCTATAAGTCCATCTTTGTTTTGTCTCCCAACTTTGTCCAGAATGAGTGGTGCCATT 
ATGAATTCTACTTTGCCCACCACAATCTCTTCCATGAAAATTCTGATCATATAATTCTTATC 
TTACTGGAACCCATTCCATTCTATTGCATTCCCACCAGGTATCATAAACTGAAAGCTCTCCT 
GGAAAAAAAAGCATACTTGGAATGGCCCAAGGATAGGCGTAAATGTGGGCTTTTCTGGGCAA 
ACCTTCGAGCTGCTATTAATGTTAATGTATTAGCCACCAGAGAAATGTATGAACTGCAGACA 
TTCACAGAGTTAAATGAAGAGTCTCGAGGTTCTACAATCTCTCTGATGAGAACAGATTGTCT 
ATAAAATCCCACAGTCCTTGGGAAGTTGGGGACCACATACACTGTTGGGATGTACATTGATA 
CAACCTTTATGATGGCAATTTGACAATATTTATTAAAATAAAAAATGGTTATTCCCTTCATA 
TCAGTTTCTAGAAGGATTTCTAAGAATGTATCCTATAGAAACACCTTCACAAGTTTATAAGG 
GCTTATGGAAAAAGGTGTTCATCCCAGGATTGTTTATAATCATGAAAAATGTGGCCAGGTGC 
AGTGGCTCACTCTTGTAATCCCAGCACTATGGGAGGCCAAGGTGGGTGACCCACGAGGTCAA 
GAGATGGAGACCATCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTA 
GCTGGGCGTGATGGTGCACGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCG 
CTTGAACCCGGGAGGTGGCAGTTGCAGTGAGCTGAGATCGAGCCACTGCACTCCAGCCTGGT 
GACAGAGCGAGACTCCATCTCAAAAAAAAGAAAAAAAAAAAAGAAAAAAATGGAAAACATCC 
TCATGGCCACAAAATAAGGTCTAATTCAATAAATTATAGTACATTAATGTAATATAATATTA 
CATGCCACTAAAAAGAATAAGGTAGCTGTATATTTCCTGGTATGGAAAAAACATATTAATAT 
GTTATAAACTATTAGGTTGGTGCAAAACTAATTGTGGTTTTTGCCATTGAAATGGCATTGAA 
ATAAAAGTGTAAAGAAATCTATACCAGATGTAGTAACAGTGGTTTGGGTCTGGGAGGTTGGA 
TTACAGGGAGCATTTGATTTCTATGTTGTGTATTTCTATAATGTTTGAATTGTTTAGAATGA 
ATCTGTATTTCTTTTATAAGTAGAAAAAAAATAAAGATAGTTTTTACAGCCT 
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MRLIRNIYIFCSIVMTAEGDAPELPEERELMTNCSNMSLRKVPADLTPATTTLDLSYNLLFQ 
LQSSDFHSVSKLRVLILCHNRIQQLDLKTFEFNKELRYLDLSNNRLKSVTWYLLAGLRYLDL 
SFNDFDTMPICEEAGNMSHLEILGLSGAKIQKSDFQKIAHLHLNTVFLGFRTLPHYEEGSLP 
ILNTTKLHIVLPMDTNFWVLLRDGIKTSKILEMTNIDGKSQFVSYEMQRNLSLENAKTSVLL 
LNKVDLLVTODLFLILQFVWHTSVEHFQI 

VFYIQQDKIYLLLTKMDIENLTISNAQMPHMLFPNYPTKFQYLNFANNILTDELFKRTIQLP 

HLKTLILNGNKLETLSLVSCFANNTPLEHLDLSQNLLQHKNDENCSWPETVVNMNLSYNKLS 

DSVFRCLPKSIQILDLNNNQIQTVPKETIHLMALRELNIAFNFLTDLPGCSHFSRLSVLNIE 

l^FILSPSLDFVQSCQEVKTLNAGRNPFRCTCELKNFIQLETYSEVMMVGWSDSYTCEYPLN 

LRGTRLKDVHLHELSCNTALLIVTIWIMLVLGLAVAFCCLHFDLPWYLRMLGQCTQ 

RKTTQEQLKRNVRFHAFISYSEHDSLWVKNELIPNLEKEDGSILICLYESYFDPGKSISENI 

VS F IEKSYKS I FVLSPNF VQNEWCHYEF YF AHHNLFHENSDH 1 1 L I LLE P I P F Y C I PTRYHK 

LKALLEKKAYLEWPKDRRKCGLFWANLRAAINVNVLATREMYELQTFTELNEESRGSTISLM 

RTDCL 
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GGGGGCTTTCTTGGGCTTGGCTGCTTGGAACACCTGCCTCCAAGGACCGGCCTCGGAGGGGT 
CGCCGGGAAAGGGAGGGAAGAAGGAAGGGCGGGGCCGGCCCCCCTGCGCCCGCCCCGCGCCT 
CTGCGCGCCCCTGTCCGCCCCGGCCCAGCCCAGCCCAGCCCCGCGGGCCGGTCACACGCGCA 
GCCAGCCGGCCGCCTCCCGCGCCCAAGCGCGCCGCTCTGCTGTGCCCTGCGCCCTTGCCCCG 
CGCCAGCTTCTGCGCCCGCAGCCCGCCCGGCGCCCCCGGTGACCGTGACCCTGCCCTGGGCG 
CGGGGCGGAGCAGGCATGTCCCGCCCGGGGACCGCTACCCCAGCGCTGGCCCTGGTGCTCCT 
GGCAGTGACCCTGGCCGGGGTCGGAGCCCAGGGCGCAGCCCTCGAGGACCCTGATTATTACG 
GGCAGGAGATCTGGAGCCGGGAGCCCTACTACGCGCGCCCGGAGCCCGAGCTCGAGACCTTC 
TCTCCGCCGCTGCCTGCGGGGCCCGGGGAGGAGTGGGAGCGGCGCCCGCAGGAGCCCAGGCC 
GCCCAAGAGGGCCACCAAGCCCAAGAAAGCTCCCAAGAGGGAGAAGTCGGCTCCGGAGCCGC 
CTCCACCAGGTAAACACAGCAACAAAAAAGTTATGAGAACCAAGAGCTCTGAGAAGGCTGCC 
AACGATGATCACAGTGTCCGTGTGGCCCGTGAAGATGTCAGAGAGAGTTGCCCACCTCTTGG 
TCTGGAAACCTTAAAAATCACAGACTTCCAGCTCCATGCCTCCACGGTGAAGCGCTATGGCC 
TGGGGGCACATCGAGGGAGACTCAACATCCAGGCGGGCATTAATGAAAATGATTTTTATGAC 
GGAGCGTGGTGCGCGGGAAGAAATGACCTCCAGCAGTGGATTGAAGTGGATGCTCGGCGCCT 
GACCAGATTCACTGGTGTCATCACTCAAGGGAGGAACTCCCTCTGGCTGAGTGACTGGGTGA 
CATCCTATAAGGTCATGGTGAGCAATGACAGCCACACGTGGGTCACTGTTAAGAATGGATCT 
GGAGACATGATATTTGAGGGAAACAGTGAGAAGGAGATCCCTGTTCTCAATGAGCTACCCGT 
CCCCATGGTGGCCCGCTACATCCGCATAAACCCTCAGTCCTGGTTTGATAATGGGAGCATCT 
GCATGAGAATGGAGATCCTGGGCTGCCCACTGCCAGATCCTAATAATTATTATCACCGCCGG 
AACGAGATGACCACCACTGATGACCTGGATTTTAAGCACCACAATTATAAGGAAATGCGCCA 
GTTGATGAAAGTTGTGAATGAAATGTGTCCCAATATCACCAGAATTTACAACATTGGAAAAA 
GCCACCAGGGCCTGAAGCTGTATGCTGTGGAGATCTCAGATCACCCTGGGGAGCATGAAGTC 
GGTGAGCCCGAGTTCCACTACATCGCGGGGGCCCACGGCAATGAGGTGCTGGGCCGGGAGCT 
GCTGCTGCTGCTGGTGCAGTTCGTGTGTCAGGAGTACTTGGCCCGGAATGCGCGCATCGTCC 
ACCTGGTGGAGGAGACGCGGATTCACGTCCTCCCCTCCCTCAACCCCGATGGCTACGAGAAG 
GCCTACGAAGGGGGCTCGGAGCTGGGAGGCTGGTCCCTGGGACGCTGGACCCACGATGGAAT 
TGACATCAACAACAACTTTCCTGATTTAAACACGCTGCTCTGGGAGGCAGAGGATCGACAGA 
ATGTCCCCAGGAAAGTTCCCAATCACTATATTGCAATCCCTGAGTGGTTTCTGTCGGAAAAT 
GCCACGGTGGCTGCCGAGACCAGAGCAGTCATAGCCTGGATGGAAAAAATCCCTTTTGTGCT 
GGGCGGCAACCTGCAGGGCGGCGAGCTGGTGGTGGCGTATCCCTACGACCTGGTGCGGTCCC 
CCTGGAAGACGCAGGAACACACCCCCACCCCCGATGACCACGTGTTCCGCTGGCTGGCCTAC 
TCCTATGCCTCCACACACCGCCTCATGACAGACGCCCGGAGGAGGGTGTGCCACACGGAGGA 
CTTCCAGAAGGAGGAGGGCACTGTCAATGGGGCCTCCTGGCACACCGTCGCTGGAAGTCTGA 
ACGATTTCAGCTACCTTCATACAAACTGCTTCGAACTGTCCATCTACGTGGGCTGTGATAAA 
TACCCACATGAGAGCCAGCTGCCCGAGGAGTGGGAGAATAACCGGGAATCTCTGATCGTGTT 
CATGGAGCAGGTTCATCGTGGCATTAAAGGCTTGGTGAGAGATTCACATGGAAAAGGAATCC 
CAAACGCCATTATCTCCGTAGAAGGCATTAACCATGACATCCGAACAGCCAACGATGGGGAT 
TACTGGCGCCTCCTGAACCCTGGAGAGTATGTGGTCACAGCAAAGGCCGAAGGTTTCACTGC 
ATCCACCAAGAACTGTATGGTTGGCTATGACATGGGGGCCACAAGGTGTGACTTCACACTTA 
GCT^AAACCAACATGGCCAGGATCCGAGAGATCATGGAGAAGTTTGGGAAGCAGCCCGTCAGC 
CTGCCAGCCAGGCGGCTGAAGCTGCGGGGGCGGAAGAGACGACAGCGTGGGTGACCCTCCTG 
GGCCCTTGAGACTCGTCTGGGACCCATGCAAATTAAACCAACCTGGTAGTAGCTCCATAGTG 
GACTCACTCACTGTTGTTTCCTCTGTAATTCAAGAAGTGCCTGGAAGAGAGGGTGCATTGTG 
AGGCAGGTCCCAAAAGGGAAGGCTGGAGGCTGAGGCTGTTTTCTTTTCTTTGTTCCCATTTA 
TCCAAATAACTTGGACAGAGCAGCAGAGAAAAGCTGATGGGAGTGAGAGAACTCAGCAAGCC 
AACCTGGGAATCAGAGAGAGAAGGAGAAGGAGGGGAGCCTGTCCGTTCAGAGCCTCTGGCTGC 
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ATAGAAAAGGATTCTGGTGCTTCCCCTGTTTGCGTGGCAGCAAGGGTTCCACGTGCATTTGC 
AATTTGCACAGCTAAAATTGCAGCATTTCCCCAGCTGGGCTGTCCCAAATGTTACCATTTGA 
GATGCTCCCAGGCGTCCTAAGAGAATCCACCCTCTCTGGCCCTGGGACATTGCAAGCTGCTA 
CAAATAAATTCTGTGTTCTTTTGACAATAGCGTCATTGCCAAGTGCACATCAGTGAGCCTCT 
TGAATCTGTTTAGTCTCCTTTTTCAACAAAGGAGTGTGTTCAGAAAAGGAGAGAGAGGCTGA 
GATCATTCAGGAGTTTGTTGGGCAGCAAGCATGGAGCTTCTTGCACAAATTCTGGGTCCATA 
AACAACCCCCAAAGTCCCTGCTGATCCAGTAGCCCTGGAGGTTCCCCAGGTAGGGAGAGCCA 
GAGGTGCCAGCCTTCCTGAAGGGCCAGAAAATTTAGCCTGGATCTCCTCTTTTACCTGCTAG 
GACTGGAAAGAGCCAGAAGTGGGGTGGCCTGAAGCCCTCTCTCTGCTTGAGGTATTGCCCCT 
GTGTGGAATTGAGTGCTCATGGGTTGGCCTCATATCAGCCTGGGAGTTATTTTTGATATGTA 
GAATGCCAGATCTTCCAGATTAGGCTAAATGTAATGAAAACCTCTTAGGATTATCTGTGGAG 
CATCAGTTTGGGAAGAATTATTGAATTATCTTGCAAGAAAAAAGTATGTCTCACTTTTTGTT 
AATGTTGCTGCCTCATTGACCTGGGAAAAATGAAAAAAAAAAATAAAGCAAATGGTAAGACC 
CTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MSRPGTATPAIiALVLLAVTLAGVGAQGAALEDPDYYGQEIWSREPYYARPEPELETFSPPLP 

AGPGEEWERRPQEPRPPKRATKPKKAPKREKSAPEPPPPGKHSNKKVMRTKSSEKAANDDHS 

VRVAREDVRESCPPLGLETLKITDFQLHASTVKRYGLGAHRGRLNIQAGINENDFYDGAWCA 

GROTLQQWIEVDARRLTRFTGVITQGRNSLWLSDWVTSYKVMVSNDSHTWVTVKNGSGDMIF 

EGNSEKEIPVLNELPVP1WARYIRINPQSWFDNGSICMRMEILGCPLPDPNNYYHRRNEMTT 

TDDLDFKHHNYKEMRQLMKVVNEMCPNITRIYNIGKSHQGLKLYAVEISDHPGEHEVGEPEF 

HYIAGAHGNEVLGRELLLLLVQFVCQEYLARNARIVHLVEETRIHVLPSLNPDGYEK^^ 

SELGGWSLGRWTHDGIDINNNFPDLNTLLWEAEDRQNVPRKVPNHYIAIPEWFLSENATVAA 

ETRAVIAWMEKIPFVLGGNLQGGELVVAYPYDLVRSPWKTQEHTPTPDDHVFRWLAYSYAST 

HRLMTDARRRVCHTEDFQKEEGTVNGASWHTVAGSLNDFSYLHTNCFELSIYVGCDKYPHES 

QLPEEWENNRESLIVFMEQVHRGIKGLVRDSHGKGI PNAI ISVEGINHDIRTANDGDYWRLL 

NPGEYWTAKAEGFTASTKNCMVGYDMGATRCDFTLSKTNMARIREIMEKFGKQPVSLPARR 

LKLRGRKRRQRG 
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CTAAGAGGACAAGATGAGGCCCGGCCTCTCATTTCTCCTAGCCCTTCTGTTCTTCCTTGGCC 

AAGCTGCAGGGGATTTGGGGGATGTGGGACCTCCAATTCCCAGCCCCGGCTTCAGCTCTTTC 

CCAGGTGTTGACTCCAGCTCCAGCTTCAGCTCCAGCTCCAGGTCGGGCTCCAGCTCCAGCCG 

CAGCTTAGGCAGCGGAGGTTCTGTGTCCCAGTTGTTTTCCAATTTCACCGGCTCCGTGGATG 

ACCGTGGGACCTGCCAGTGCTCTGTTTCCCTGCCAGACACCACCTTTCCCGTGGACAGAGTG 

GAACGCTTGGAATTCACAGCTCATGTTCTTTCTCAGAAGTTTGAGAAAG/^ACTTTCTAAAGT 

GAGGG AATATGTCCAATTAATTAGTGTGTATGAAAAGAAACTGTTAAACCTAACTGTCCGAA 

TTGACATCATGGAGAAGGATACCATTTCTTACACTGAACTGGACTTCGAGCTGATCAAGGTA 

GAAGTGAAGGAGATGGAAAAACTGGTCATACAGCTGAAGGAGAGTTTTGGTGGAAGCTCAGA 

AATTGTTGACCAGCTGGAGGTGGAGATAAGAAATATGACTCTCTTGGTAGAGAAGCTTGAGA 

CACTAGACAAAAACAATGTCCTTGCCATTCGCCGAGAAATCGTGGCTCTGAAGACCAAGCTG 

AAAGAGTGTGAGGCCTCTAAAGATCAAAACACCCCTGTCGTCCACCCTCCTCCCACTCCAGG 

GAGCTGTGGTCATGGTGGTGTGGTGAACATCAGCAAACCGTCTGTGGTTCAGCTCAACTGGA 

GAGGGTTTTCTTATCTATATGGTGCTTGGGGTAGGGATTACTCTCCCCAGCATCCAAACAAA 

GGACTGTATTGGGTGGCGCCATTGAATACAGATGGGAGACTGTTGGAGTATTATAGACTGTA 

CAACACACTGGATGATTTGCTATTGTATATAAATGCTCGAGAGTTGCGGATCACCTATGGCC 

AAGGTAGTGGTACAGCAGTTTACAACAACAACATGTACGTCAACATGTACAACACCGGGAAT 

ATTGCCAGAGTTAACCTGACCACCAACACGATTGCTGTGACTCAAACTCTCCCTAATGCTGC 

CTATAATAACCGCTTTTCATATGCTAATGTTGCTTGGCAAGATATTGACTTTGCTGTGGATG 

AGAATGGATTGTGGGTTATTTATTCAACTGAAGCCAGCACTGGTAACATGGTGATTAGTAAA 

CTCAATGACACCACACTTCAGGTGCTAAACACTTGGTATACCAAGCAGTATAAACCATCTGC 

TTCTAACGCCTTCATGGTATGTGGGGTTCTGTATGCCACCCGTACTATGAACACCAGAACAG 

AAGAGATTTTTTACTATTATGACACAAACACAGGGAAAGAGGGCAAACTAGACATTGTAATG 

CATAAGATGCAGGAAAAAGTGCAGAGCATTAACTATAACCCTTTTGACCAGAAACTTTATGT 

CTATAACGATGGTTACCTTCTGAATTATGATCTTTCTGTCTTGCAGAAGCCCCAGTAAGCTG 

TTTAGGAGTTAGGGTGAAAGAGAAAATGTTTGTTGAAAAAATAGTCTTCTCCACTTACTTAG 

ATATCTGCAGGGGTGTCTAAAAGTGTGTTCATTTTGCAGCAATGTTTAGGTGCATAGTTCTA 

CCACACTAGAGATCTAGGACATTTGTCTTGATTTGGTGAGTTCTCTTGGGAATCATCTGCCT 

CTTCAGGCGCATTTTGCAATAAAGTCTGTCTAGGGTGGGATTGTCAGAGGTCTAGGGGCACT 

GTGGGCCTAGTGAAGCCTACTGTGAGGAGGCTTCACTAGAAGCCTTAAATTAGGAATTAAGG 

AACTTAAAACTCAGTATGGCGTCTAGGGATTCTTTGTACAGGAAATATTGCCCAATGACTAG 

TCCTCATCCATGTAGCACCACTAATTCTTCCATGCCTGGAAGAAACCTGGGGACTTAGTTAG 

GTAGATTAATATCTGGAGCTCCTCGAGGGACCAAATCTCCAACTTTTTTTTCCCCTCACTAG 

CACCTGGAATGATGCTTTGTATGTGGCAGATAAGTAAATTTGGCATGCTTATATATTCTACA 

TCTGTAAAGTGCTGAGTTTTATGGAGAGAGGCCTTTTTATGCATTAAATTGTACATGGCAAA 

TAAATCCCAGAAGGATCTGTAGATGAGGCACCTGCTTTTTCTTTTCTCTCATTGTCCACCTT 

ACTAAAAGTCAGTAGAATCTTCTACCTCATAACTTCCTTCCAAAGGCAGCTCAGAAGATTAG 

AACCAGACTTACTAACCAATTCCACCCCCCACCAACCCCCTTCTACTGCCTACTTTAAAAAA 

ATTAATAGTTTTCTATGGAACTGATCTAAGATTAGAAAAATTAATTTTCTTTAATTTCATTA 

TGGACTTTTATTTACATGACTCTAAGACTATAAGAAAATCTGATGGCAGTGACAAAGTGCTA 

GCATTTATTGTTATCTAATAAAGACCTTGGAGCATATGTGCAACTTATGAGTGTATCAGTTG 

TTGCATGTAATTTTTGCCTTTGTTTAAGCCTGGAACTTGTAAGAAAATGAAAATTTAATTTT 

TTTTTCTAGGACGAGCTATAGAAAAGCTATTGAGAGTATCTAGTTAATCAGTGCAGTAGTTG 

GAAACCTTGCTGGTGTATGTGATGTGCTTCTGTGCTTTTGAATGACTTTATCATCTAGTCTT 

TGTCTATTTTTCCTTTGATGTTCAAGTCCTAGTCTATAGGATTGGCAGTTTAAATGCTTTAC 

TCCCCCTTTTAAAATAAATGATTAAAATGTGCTTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MRPGLSFLLALLFFLGQAAGDLGDVGPPIPSPGFSSFPGVDSSSSFSSSSRSGSSSSRSLGS 

GGSVSQLFSNFTGSVDDRGTCQCSVSLPDTTFPVDRVERLEFTAHVLSQKFEKELSKVREYV 

QL I S VYEKKLLNLTVR I DIMEKDT I S YTELDFEL I K VE VKEMEKLV IQLKES FGGS SE I VDQ 

LEVEIRNMTLLVEKLETLDKNNVLAIRREIVALKTKLKECEASKDQNTPVVHPPPTPGSCGH 

GGWNI SKPS WQLNWRGFS YL YGAWGRDYS PQH PNKGLYWVAPLNTDGRLLE YYRLYNTLD 

DLLLYINARELRITYGQGSGTAVYNNl^YVNMYNTGNIARVNLTTNTIAVTQTLPNA^ 

FS YANVAWQD I D F AVDENGLWV I Y STE ASTGNMV I S KLNDTTLQ VLNTWYTKQ YKP S ASNAF 

MVCGVLYATRTMNTRTEE I FYYYDTNTGKEGKLDI VMHKMQEKVQS INYNPFDQKLYVYNDG 

YLLNYDLSVLQKPQ 
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FIGURE 39 

GCTCTGAAGACCAAGCTGAAAGAGTGTGAGGCCTCTAAAGATCAAACACCCCTGTCGTCCAC 
CCTCCTCCCACTCCAGGGAGCTGTGGTCATGGTGGTGTGGTGAACATCAGCAAACCGTCTGT 
GGTTCAGCTCAACTGGAGAGGGTTTTCTTATCTATATGGTGCTTGGGGTAGGGATTACTCTC 
CCCAGCATCCAAACAAAGGNATGTATTGGGNGGCGCCATTGAATACAGATGGGAGACTGTTG 
GAGTATTATAGACTGTACAACCCACTGGATGATTTGCTATTGTATATAAATGCTCGAGAGTT 
GCGGATCACCTATGGCCAAGGTAGTGGTACAGCAGTTTACAACAACAACATGTACGTCAACA 
TGTACAACACCGGGNATATTGCCAGAGTTAACCTGACC 
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FIGURE 40 

TCTCGCAGATAGTAAATAATCTCGGAAAGGCGAGAAAGAAGCTGTCTCCATCTTGTCTGTAT 
CCGCTGCTCTTGTGACGTTGTGGAGATGGGGAGCGTCCTGGGGCTGTGCTCCATGGCGAGCT 
GGATACCATGTTTGTGTGGAAGTGCCCCGTGTTTGCTATGCCGATGCTGTCCTAGTGGAAAC 
AACTCCACTGTAACTAGATTGATCTATGCACTTTTCTTGCTTGTTGGAGTATGTGTAGCTTG 
TGTAATGTTGATACCAGGAATGGAAGAACAACTGAATAAGATTCCTGGATTTTGTGAGAATG 
AGAAAGGTGTTGTCCCTTGTAACATTTTGGTTGGCTATAAAGCTGTATATCGTTTGTGCTTT 
GGTTTGGCTATGTTCTATCTTCTTCTCTCTTTACTAATGATCAAAGTGT^AGAGTAGCAGTGA 
TCCTAGAGCTGCAGTGCACAATGGATTTTGGTTCTTTAAATTTGCTGCAGCAATTGCAATTA 
TTATTGGGGCATTCTTCATTCCAGAAGGAACTTTTACAACTGTGTGGTTTTATGTAGGCATG 
GCAGGTGCCTTTTGTTTCATCCTCATACAACTAGTCTTACTTATTGATTTTGCACATTCATG 
GAATGAATCGTGGGTTGAAAAAATGGAAGAAGGGAACTCGAGATGTTGGTATGCAGCCTTGT 
TATCAGCTACAGCTCTGAATTATCTGCTGTCTTTAGTTGCTATCGTCCTGTTCTTTGTCTAC 
TACACTCATCCAGCCAGTTGTTCAGAAAACAAGGCGTTCATCAGTGTCAACATGCTCCTCTG 
CGTTGGTGCTTCTGTAATGTCTATACTGCCAAAAATCCAAGAATCACAACCAAGATCTGGTT 
TGTTACAGTCTTCAGTAATTACAGTCTACACAATGTATTTGACATGGTCAGCTATGACCAAT 
GAACCAGAAACAAATTGCAACCCAAGTCTACTAAGCATAATTGGCTACAATACAACAAGCAC 
TGTCCCAAAGGAAGGGCAGTCAGTCCAGTGGTGGCATGCTCAAGGAATTATAGGACTAATTC 
TCTTTTTGTTGTGTGTATTTTATTCCAGCATCCGTACTTCAAACAATAGTCAGGTTAATAAA 
CTGACTCTAACAAGTGATGAATCTACATTAATAGAAGATGGTGGAGCTAGAAGTGATGGATC 
ACTGGAGGATGGGGACGATGTTCACCGAGCTGTAGATAATGAAAGGGATGGTGTCACTTACA 
GTTATTCCTTCTTTCACTTCATGCTTTTCCTGGCTTCACTTTATATCATGATGACCCTTACC 
AACTGGTCCAGGTATGAACCCTCTCGTGAGATGAAAAGTCAGTGGACAGCTGTCTGGGTGAA 
AATCTCTTCGAGTTGGATTGGCATCGTGCTGTATGTTTGGACACTCGTGGCACCACTTGTTC 
TTACAAATCGTGATTTTGACTGAGTGAGACTTCTAGCATGAAAGTCCCACTTTGATTATTGC 
TTATTTGAAAACAGTATTCCCAACTTTTGTAAAGTTGTGTATGTTTTTGCTTCCCATGTAAC 
TTCTCCAGTGTTCTGGCATGAATTAGATTTTACTGCTTGTCATTTTGTTATTTTCTTACCAA 
GTGCATTGATATGTGAAGTAGAATGAATTGCAGAGGAAAGTTTTATGAATATGGTGATGAGT 
TAGTAAAAGTGGCCATTATTGGGCTTATTCTCTGCTCTATAGTTGTGAAATGAAGAGTAAAA 
ACAAATTTGTTTGACTATTTTAAAATTATATTAGACCTTAAGCTGTTTTAGCAAGCATTAAA 
GCAAATGTATGGCTGCCTTTTGAAATATTTGATGTGTTGCCTGGCAGGATACTGCAAAGAAC 
ATGGTTTATTTTAAAATTTATAAACAAGTCACTTAAATGCCAGTTGTCTGAAAAATCTTATA 
AGX3TTTTACCCTTGATACGKSAATTTA 

TGGATGGAGGTGTCGGTACTAAATTGAATAACGAGTAAATAATCTTACTTGGGTAGAGATGG 
CCTTTGCCAACAAAGTGAACTGTTTTGGTTGTTTTAAACTCATGAAGTATGGGTTCAGTGGA 
AATGTTTGGAACTCTGAAGGATTTAGACAAGGTTTTGAAAAGGATAATCATGGGTTAGAAGG 
AAGTGTTTTGAAAGTCACTTTGAAAGTTAGTTTTGGGCCCAGCACGGTAGCTCACCCTTGGT 
AATCCCAGCACTTTGGGAGCTTAAGTGGGTAGATTACTTGAGCCCAGGAATTCAGACCAGCT 
TGGCACATGGTGAACCTGTTCTATAAAAATAATCTGGCTTTGAGCATATGCCTGTGGTCCAG 
CACTGAGAGGCTAGTGAAGATTGCTGAGCCCAGAGCCAAAGGTTGCAGTGAGCAAGTCACGT 
CACTGCACTCTAGCTGGCACAGAGTAAGCCAAAAAAATATATATATATTGAAATCAAGGAGG 
CAAAATTTTGACAGGGAAGGAAGTAACTGCAAAACCACTAGGCTTTAGTAGGTACTTATATA 
AAATCTAGTCCAGTTCTCTCATTTAAAAAAATGAAGACACTGAAATACAGACTTAAATAGCT 
CAGATAGCTAATTAGGAAATTTCAAGTTGGCCAATAATAGCATTCTCTCTGACATTTAAAAA 
TAATTTCTATTCAAAATACATGCATATTGATTTACACCTCATACTGTGATAATTAATGTGAT 
GTGGATTGCTGGTGTCCAGCATGACCCATAAACAGGTCAGAAGAATGATGGAATGTTTTAGA 
ATAAACTCCTGCTTATAGTATACTACACAGTTCAAAAGATGTTTAAAATGCTTTTGTATTTA 
CTGCCATGTAATTGAAATATATAGATTATTGTAACCTTTCAACCTGAAAATCAAGCAGTATG 
AGAGTTTAGTTATTTGTATGTGTCACTAGTGTCTAATGAAGCTTTTAAAATCTACAATTTCT 
TCTTTAAAAATATTTATTAATGTGAATGGAATATAACAATTCAGCTTAATTCCCCAACCTTA 
TTCTGTGTGTAGACATTGTATTCCACAATTTTGAATGGCTGTGTTTTACCTCTAAATAAATG 
AATTCAGAGAAAAAAAAAAAAAAA 
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FIGURE 41 

MGS VLGLCSMASWI PCLCGSAPCLLCRCCPSGNNSTVTRLI YALFLLVGVCVACVMLI PGME 
EQLNKIPGFCENEKGWPCNILVGYKAVYRLCFGLAMFYLLLSLLMIKVKSSSDPRAAW 
FWFFKFAAAI AI I IGAFF I PEGTFTTVWFYVGMAGAFCF I LI QLVLL I DFAHSWNESWVEKM 
EEGNSRCWYAALLSATALNYLLSLVAIVLFF 

LPKIQESQPRSGLLQSSVITVYTMYLTWSAMTNEPETNCNPSLLSIIGYNTTSTVPKEGQSV 
QWWHAQGI IGLILFLLCVFYSS IRTSNNSQVNKLTLTSDESTLIEDGGARSDGSLEDGDDVH 
RAVDITCRDGVTYSYSFFHFMLFLASLYIMMTLTNWSRY 
VLYVWTLVAPLVLTNRDFD 
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FIGURE 42 

GCGAGAAAGAAGCTGTCTCCATCTTGTCTGTATCCCGCTGCTTCTTGNGACGTTGTGGAGAT 
GGGGAGCGTCCCTGGGGCTGTGCTC.CATGGCGAGCTGGATACCATGTTTGTGTGGAAGTGCC 
CCGTGTTTGCTATGCCGATGCTGTCCTAGTGGAAACAANTCCACTGTAACTAGATTGATCTA 
TGCACTTTTCTTGCTTGTTGGAGTATGTGTAGCTTGTGTAATGTTGATACCAGGAATGGAAG 
AACAACTGAATAAGATTCCTGGATTTTGTGAGAATGAGAAAGGTGTTGTCCCTTGTAACATT 
TTGGTTGGCTATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCT 
CTCTTTACTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCTGCAGTGCACAATGGAT 
TTTGGTTCTTTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGC 
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FIGURE 43 



GTTATTGTGAACTTTGTGGAGATGGGAGGTCNTGGGGCTGTGTTCCATGGCGAGCTGGATAC 
CANGTTTGTGTGGAAGTGCCCCGTGTTTGNTATGCCGATGCTGTCCTAGTGGAAACAANTCC 
ACTGTAATTAGATTGATNTATGCACTTTTNTTGCTTGTTGGAGTANGTGTAGCTTGTGTAAT 
GTTGATACCAGGAATGGAAGAACAACTGAATAAGATTCCTGGATTTTGTGAGAATGAGAAAG 
GTGTTGTCCCTTGTAACATTTTGGTTGGCTATAAAGCTGTATATNGTTTGTGCTTTGGTTTG 
GCTANGTTCTATNTTCTTCTCTCTTTACTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAG 
AGCTGCAGTGCACAATGGATTTTGGTTTTTTAAATTTGCTGCAGCAATTGCAATTATTATTG 
GGGC 
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FIGURE 44 

AAGAAGCTGTCTCCATCTTGTCTGTATCCGCTGCTCTTGTGAACGTTNTGGAGATGGGGAGC 
GTCCTTGGGGTTGTGCTCCATGGCGAGCTGGATACCATGTTTGTGTGGAAGTGCCCCGTGTT 
TGCTATGCCGATGCTGTCCTAGTGGAAACAACTCCACTGTAACTAGATTGATCTATGCACTT 
TTCTTGCTTGTTGGAGTATGTGTAGCTTGTGTAATGTTGATACCAGGAATGGAAGAACAACT 
GAATAAGATTCCTGGATTTTGTGAGAATGAGAAAGGTGTTGTCCCTTGTAACATTTTGGTTG 
GCTATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCTCTCTTTA 
CTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCTGCAGTGCACAATGGATTTTGGTT 
CTTTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGC 
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FIGURE 45 

GCTGTCCTTAGTGGAAACAANTCCAACTTGTAACTTGGATTGATCTATGCACTTTTTCCTTG 
CTTGTTGGAGTATGTGTAGCTTTGTGTAATGTTGTTCCCAGGATTGGANGAACAACTGAATA 
AGATTCCTGGATTTTTGTGAGAATGAGAAAGGTGTTGTCCCCTTGTAACATTTTTGGTTGGC 
TATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCTCTCTTTACT 
AATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCTGCAGTGCACAATGGATTTTGGTTCT 
TTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGCATTCTTCATTCCAGAAGGAACTTTT 
ACAACTGTGTGGTTTTATGTAGGCATGGCAGGTGCCTTTTGTTTCATCCTCATACAACTAGT 
CTTACTTATTGATTTTGCACATTCATGGAATGAATCGTGGGTTGAAAAAATGGAAGAAGGGA 
ACTCGAGATGTTGGTATGCAGCCTTGTTATCAGCTACAGCTCTGAATTATCTGCTGTCTTTA 
GTTGCTATCGTCCTGTTCTTTGTCTACTACACTCATCCAGCCAGTTGTTCAGAAAACAAGGC 
GTTCATCAGTGTCAACATGCTCCTCTGCGTTGGTGCTTCTGTAATG 



WO 99/63088 



PCT/US99/12252 



FIGURE 46A 

CTCGGGCGCGCACAGGCAGCTCGGTTTGCCCTGCGATTGAGCTGCGGGTCGCGGCCGGCGCC 
GGCCTCTCCAATGGCAAATGTGTGTGGCTGGAGGCGAGCGCGAGGCTTTCGGCAAAGGCAGT 
CGAGTGTTTGCAGACCGGGGCGAGTCCTGTGAAAGCAGATAAAAGAAAACATTTATTAACGT 
GTCATTACGAGGGGAGCGCCCGGCCGGGGCTGTCGCACTCCCCGCGGAACATTTGGCTCCCT 
CCAGCTCCGAGAGAGGAGAAGAAGAAAGCGGAAAAGAGGCAGATTCACGTCGTTTCCAGCCA 
AGTGGACCTGATCGATGGCCCTCCTGAATTTATCACGATATTTGATTTATTAGCGATGCCCC 
CTGGTTTGTGTGTTACGCACACACACGTGCACACAAGGCTCTGGCTCGCTTCCCTCCCTCGT 
TTCCAGCTCCTGGGCGAATCCCACATCTGTTTCAACTCTCCGCCGAGGGCGAGCAGGAGCGA 
GAGTGTGTCGAATCTGCGAGTGAAGAGGGACGAGGGAAAAGAAACAAAGCCACAGACGCAAC 
TTGAGACTCCCGCATCCCAAAAGAAGCACCAGATCAGCAAAAAAAGAAGATOGGCCCCCCGA 
GCCTCGTGCTGTGCTTGCTGTCCGCAACTGTGTTCTCCCTGCTGGGTGGAAGCTCGGCCTTC 
CTGTCGCACCACCGCCTGAAAGGCAGGTTTCAGAGGGACCGCAGGAACATCCGCCCCAACAT 
CATCCTGGTGCTGACGGACGACCAGGATGTGGAGCTGGGTTCCATGCAGGTGATGAACAAGA 
CCCGGCGCATCATGGAGCAGGGCGGGGCGCACTTCATCAACGCCTTCGTGACCACACCCATG 
TGCTGCCCCTCACGCTCCTCCATCCTCACTGGCAAGTACGTCCACAACCACAACACCTACAC 
CAACAATGAGAACTGCTCCTCGCCCTCCTGGCAGGCACAGCACGAGAGCCGCACCTTTGCCG 
TGTACCTCAATAGCACTGGCTACCGGACAGCTTTCTTCGGGAAGTATCTTAATGAATACAAC 
GGCTCCTACGTGCCACCCGGCTGGAAGGAGTGGGTCGGACTCCTTAAAAACTCCCGCTTTTA 
TAACTACACGCTGTGTCGGAACGGGGTGAAAGAGAAGCACGGCTCCGACTACTCCAAGGATT 
ACCTCACAGACCTCATCACCAATGACAGCGTGAGCTTCTTCCGCACGTCCAAGAAGATGTAC 
CCGCACAGGCCAGTCCTCATGGTCATCAGCCATGCAGCCCCCCACGGCCCTGAGGATTCAGC 
CCCACAATATTCACGCCTCTTCCCAAACGCATCTCAGCACATCACGCCGAGCTACAACTACG 
CGCCCAACCCGGACAAACACTGGATCATGCGCTACACGGGGCCCATGAAGCCCATCCACATG 
GAATTCACCAACATGCTCCAGCGGAAGCGCTTGCAGACCCTCATGTCGGTGGACGACTCCAT 
GGAGACGATTTACAACATGCTGGTTGAGACGGGCGAGCTGGACAACACGTACATCGTATACA 
CCGCCGACCACGGTTACCACATCGGCCAGTTTGGCCTGGTGAAAGGGAAATCCATGCCATAT 
GAGTTTGACATCAGGGTCCCGTTCTACGTGAGGGGCCCCAACGTGGAAGCCGGCTGTCTGAA 
TCCCCACATCGTCCTCAACATTGACCTGGCCCCCACCATCCTGGACATTGCAGGCCTGGACA 
TACCTGCGGATATGGACGGGAAATCCATCCTCAAGCTGCTGGACACGGAGCGGCCGGTGAAT 
CGGTTTCACTTGAAAAAGAAGATGAGGGTCTGGCGGGACTCCTTCTTGGTGGAGAGAGGCAA 
GCTGCTACACAAGAGAGACAATGACAAGGTGGACGCCCAGGAGGAGAACTTTCTGCCCAAGT 
ACCAGCGTGTGAAGGACCTGTGTCAGCGTGCTGAGTACCAGACGGCGTGTGAGCAGCTGGGA 
CAGAAGTGGCAGTGTGTGGAGGACGCCACGGGGAAGCTGAAGCTGCATAAGTGCAAGGGCCC 
CATGCGGCTGGGCGGCAGCAGAGCCCTCTCC^CCTCGTGCCCTVAGTACTACGGGCAGGGCA 
GCGAGGCCTGCACCTGTGACAGCGGGGACTACAAGCTCAGCCTGGCCGGACGCCGGAAAAAA 
CTCTTCAAGAAGAAGTACAAGGCCAGCTATGTGCGCAGTCGCTCCATCCGCTCAGTGGCCAT 
CGAGGTGGACGGCAGGGTGTACCACGTAGGCCTGGGTGATGCCGCCCAGCCCCGAAACCTCA 
CCAAGCGGCACTGGCCAGGGGCCCCTGAGGACCAAGATGACAAGGATGGTGGGGACTTCAGT 
GGCACTGGAGGCCTTCCCGACTACTCAGCCGCCAACCCCATTAAAGTGACACATCGGTGCTA 
CATCCTAGAGAACGACACAGTCCAGTGTGACCTGGACCTGTACAAGTCCCTGCAGGCCTGGA 
AAGACCACAAGCTGCACATCGACCACGAGATTGAAACCCTGCAGAACAAAATTAAGAACCTG 
AGGGAAGTCCGAGGTCACCTGAAGAAAAAGCGGCCAGAAGAATGTGACTGTCACAAAATCAG 
CTACCACACCCAGCACAAAGGCCGCCTCAAGCACAGAGGCTCCAGTCTGCATCCTTTCAGGA 
AGGGCCTGCAAGAGAAGGACAAGGTGTGGCTGTTGCGGGAGCAGAAGCGCAAGAAGAAACTC 
CGCAAGCTGCTCAAGCGCCTGCAGAACAACGACACGTGCAGCATGCCAGGCCTCACGTGCTT 
CACCCACGACAACCAGCACTGGCAGACGGCGCCTTTCTGGACACTGGGGCCTTTCTGTGCCT 
GCACCAGCGCCAACAATAACACGTACTGGTGCATGAGGACCATCAATGAGACTCACAATTTC 
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FTGIJRE 4fiB 

CTCTTCTGTGAATTTGCAACTGGCTTCCTAGAGTACTTTGATCTCAACACAGACCCCTACCA 
GCTGATGAATGCAGTGAACACACTGGACAGGGATGTCCTCAACCAGCTACACGTACAGCTCA 
TGGAGCTGAGGAGCTGCAAGGGTTACAAGCAGTGTAACCCCCGGACTCGAAACATGGACCTG 
GATGGAGGAAGCTATGAGCAATACAGGCAGTTTCAGCGTCGAAAGTGGCCAGAAATGAAGAG 
ACCTTCTTCCAAATCACTGGGACAACTGTGGGAAGGCTGGGAAGGTTAAGAAACAACAGAGG 
TGGACCTCCAAAAACATAGAGGCATCACCTGACTGCACAGGCAATGAAAAACCATGTGGGTG 
ATTTCCAGCAGACCTGTGCTATTGGCCAGGAGGCCTGAGAAAGCAAGCACGCACTCTCAGTC 
AACATGACAGATTCTGGAGGATAACCAGCAGGAGCAGAGATAACTTCAGGAAGTCCATTTTT 
GCCCCTGCTTTTGCTTTGGATTATACCTCACCAGCTGCACAAAATGCATTTTTTCGTATCAA 
AAAGTCACCACTAACCCTCCCCCAGAAGCTCACAAAGGAAAACGGAGAGAGCGAGCGAGAGA 
GATTTCCTTGGAAATTTCTCCCAAGGGCGAAAGTCATTGGAATTTTTAAATCATAGGGGAAA 
AGCAGTCCTGTTCTAAATCCTCTTATTCTTTTGGTTTGTCACAAAGAAGGAACTAAGAAGCA 
GGACAGAGGCAACGTGGAGAGGCTGAAAACAGTGCAGAGACGTTTGACAATGAGTCAGTAGC 
ACAAAAGAGATGACATTTACCTAGCACTATAAACCCTGGTTGCCTCTGAAGAAACTGCCTTC 
ATTGTATATATGTGACTATTTACATGTAATCAACATGGGAACTTTTAGGGGAACCTAATAAG 
AAATCCCAATTTTCAGGAGTGGTGGTGTCAATAAACGCTCTGTGGCCAGTGTAAAAGAAAAA 
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FIGURE 47 

MGPPSLVLCLLSATVFSLLGGSSAFLSHHRLKGRFQRDRRNIRPNIILVLTDDQDVELGSMQ 
Vl^KTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGKYVHNHNTYTNNENCSSPSWQAQHES 
RTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGSD 
YSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSRLFPNASQHITP 
SYNYAPNPDKHWIMRYTGPMKPIHMEFTIMLQRKRLQTLMSVDDSMETIYNMLVETGELDNT 
YIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCLNPHIVLNIDLAPTILDI 
AGLD I PADMDGKS I LKLLDTERPVNRFHLKKKMRVWRDSFL VERGKLLHKRDNDKVDAQEEN 
FLPKYQRVKDLCQRAEYQTACEQLGQKWQCVEDATGKLKLHKCKGPMRLGGSRALSNLVPKY 
YGQGSEACTCDSGDYKLSLAGRRKKLFKKKYKAS YVRSRS IRS VAI EVDGRVYHVGLGDAAQ 
PRNLTKRHWPGAPEDQDDKDGGDFSGTGGLPDYSAANPIKVTHRCYILENDTVQCDLDLYKS 
LQAWKDHKLHIDHEIETLQNKIKNLREVRGHLKKKRPEECDCHKISYHTQHKGRLKHRGSSL 
HPFRKGLQEKDKVWLLREQKRKKKLRKLLKRLQNNDTCSMPGLTCFTHDNQHWQTAPFWTLG 
PFCACTSANNNTYWC^RTINETHNFLFCEFATGFLEYFDLNTDPYQLMNAVNTLDRDVLNQL 
HVQLMELRSCKGYKQCNPRTRNMDLDGGSYEQYRQFQRRKWPEMKRPSSKSLGQLWEGWEG 
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FIGURE 48 

AACAAAGTTCAGTGACTGAGAGGGCTGAGCGGAGGCTGCTGAAGGGGAGAAAGGAGTGAGGA 
GCTGCTGGGCAGAGAGGGACTGTCCGGCTCCCAGATGCTGGGCCTCCTGGGGAGCACAGCCC 
TCGTGGGATGGATCACAGGTGCTGCTGTGGCGGTCCTGCTGCTGCTGCTGCTGCTGGCCACC 
TGCCTTTTCCACGGACGGCAGGACTGTGACGTGGAGAGGAACCGTACAGCTGCAGGGGGAAA 
CCGAGTCCGCCGGGCCCAGCCTTGGCCCTTCCGGCGGCGGGGCCACCTGGGAATCTTTCACC 
ATCACCGTCATCCTGGCCACGTATCTCATGTGCCGAATGTGGGCCTCCACCACCACCACCAC 
CCCCGCCACACCCCTCACCACCTCCACCACCACCACCACCCCCACCGCCACCATCCCCGCCA 
CGCTCGCTGAGGCTGCTGTCGCCGGTGCCTGTGGACAGCAGCTGCCCCTGCCCTCCCATCTG 
TTCCCAGGACAAGTGGACCCCATGTTTCCATGTGGAAGGATGCATCTCTGGGGTGAACGAGG 
GGAACAATAGACTGGGGCTTGCTCCAGCTGCATTTGCATGGCATGCCCCAGTGTACTATGGC 
AGCAGAGAATGGAGGAACACTGGGTCTGCAGTGCTGAAGGGTTTGGGGAGTGGAGAGCAAGG 
GTGCTCTTTCGGGGCTGGACAGCCCGTCTTGTGACAGTGACTCCCAGTGAGCCCCAGAAATG 
ACAAGCGTGTCTTGGCAGAGCCAGCACACAAGTGGATGTGAAGTGCCCGTCTTGACCTCCTC 
ATCAGGCTGCTGCAGGCCTCTGGCGGGCAGGGCACTGGGAGAGGCCCTGAGAATGTCCTTTT 
GGTTTGGAGAAGGCAGTGTGAGGCTGCACAGTCAATTCATCGGTGCCTTAGTCCAAGAAAAT 
AAAAACCACTAAGAAGCTTTAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 49 

MLGLLGSTALVGWITGAAVAVLLLLLLLATCLFHGRQDCDVERNRTAAGGNRVRRAQPWPFR 
RRGHLGIFHHHRHPGHVSHVPNVGLHHHHHPRHTPHHLHHHHHPHRHHPRHAR 
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FIGURE 50 

GGCGGCTGCTGAGCTGCCTTGAGGTGCAGTGTTGGGGATCCAGAGCCATGTCGGACCTGCTA 

CTACTGGGCCTGATTGGGGGCCTGACTCTCTTACTGCTGCTGACGCTGCTGGCCTTTGCCGG 

GTACTCAGGGCTACTGGCTGGGGTGGAAGTGAGTGCTGGGTCACCCCCCATCCGCAACGTCA 

CTGTGGCCTACAAGTTCCACATGGGGCTCTATGGTGAGACTGGGCGGCTTTTCACTGAGAGC 

TGCAGCATCTCTCCCAAGCTCCGCTCCATCGCTGTCTACTATGACAACCCCCACATGGTGCC 

CCCTGATAAGTGCCGATGTGCCGTGGGCAGCATCCTGAGTGAAGGTGAGGAATCGCCCTCCC 

CTGAGCTCATCGACCTCTACCAGAAATTTGGCTTCAAGGTGTTCTCCTTCCCGGCACCCAGC 

CATGTGGTGACAGCCACCTTCCCCTACACCACCATTCTGTCCATCTGGCTGGCTACCCGCCG 

TGTCCATCCTGCCTTGGACACCTACATCAAGGAGCGGAAGCTGTGTGCCTATCCTCGGCTGG 

AGATCTACCAGGAAGACCAGATCCATTTCATGTGCCCACTGGCACGGCAGGGAGACTTCTAT 

GTGCCTGAGATGAAGGAGACAGAGTGGAAATGGCGGGGGCTTGTGGAGGCCATTGACACCCA 

GGTGGATGGCACAGGAGCTGACACAATGAGTGACACGAGTTCTGTAAGCTTGGAAGTGAGCC 

CTGGCAGCCGGGAGACTTCAGCTGCCACACTGTCACCTGGGGCGAGCAGCCGTGGCTGGGAT 

GACGGTGACACCCGCAGCGAGCACAGCTACAGCGAGTCAGGTGCCAGCGGCTCCTCTTTTGA 

GGAGCTGGACTTGGAGGGCGAGGGGCCCTTAGGGGAGTCACGGCTGGACCCTGGGACTGAGC 

CCCTGGGGACTACCAAGTGGCTCTGGGAGCCCACTGCCCCTGAGAAGGGCAAGGAG TAA CCC 

ATGGCCTGCACCCTCCTGCAGTGCAGTTGCTGAGGAACTGAGCAGACTCTCCAGCAGACTCT 

CCAGCCCTCTTCCTCCTTCCTCTGGGGGAGGAGGGGTTCCTGAGGGACCTGACTTCCCCTGC 

TCCAGGCCTCTTGCTAAGCCTTCTCCTCACTGCCCTTTAGGCTCCCAGGGCCAGAGGAGCCA 

GGGACTATTTTCTGCACCAGCCCCCAGGGCTGCCGCCCCTGTTGTGTCTTTTTTTCAGACTC 

ACAGTGGAGCTTCCAGGACCCAGAATAAAGCCAATGATTTACTTGTTTCACCTGGAAAAAAA 

AAAAAAAAAA 
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FIGURE 51 

MSDLLLLGLIGGLTLLLLLTLLAFAGYSGLLAGVEVSAGSPPIRNVTVAYKFHMGLYGETGR 
LFTESCSISPKLRSIAVYYDNPHMVPPDKCRCAVGSILSEGEESPSPELIDLYQKFGFKVFS 
FPAPSHWTATFPYTTILSIWLATRRVHPALDTYIKERKLCAYPRLEIYQEDQIHFMCPLAR 
QGDFYVPEMKETEWKWRGLVEAIDTQVDGTGADTMSDTSSVSLEVSPGSRETSAATLSPGAS 
SRGWDDGDTRSEHSYSESGASGSSFEEIJDLEGEGPLGESRLDPGTEPLGTTKWLWEPTAPEKGKE 
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CCGCGGGAACGCTGTCCTGGCTGCCGCCACCCGAACAGCCTGTCCTGGTGCCCCGGCTCCCT 
GCCCCGCGCCCAGTCATGACCCTGCGCCCCTCACTCCTCCCGCTCCATCTGCTGCTGCTGCT 
GCTGCTCAGTGCGGCGGTGTGCCGGGCTGAGGCTGGGCTCGAAACCGAAAGTCCCGTCCGGA 
CCCTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCCGAGCCCGCTGCTTTTGGA 
GACACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACGTATTATTGACACCTCCCT 
GACCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGATTCCAGGTCTGGAGCAGA 
GTCTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATTCCTTCTCACTTGGCCTAT 
GGAAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGTGCAGTATGACGTGGAGCT 
GATTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGGGCATTTTGCCTCTGGTAG 
GGATGGCCATGGTGCCAGCCCTCCTGGGCCTCATTGGGTATCACCTATACAGAAAGGCCAAT 
AGACCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAACAAGAGCAAAAAGAAATA 
ATAAATAATAAATTTTAAAAAACTTAAAAAAAAAAAAAAAAAA 
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FIGURE 53 

MTLRPSLLPLHLLLLLLLSAAVCRAEAGLETESPVRTLQVETLVEPPEPCAEPAAFGDTLHI 
HYTGSLVDGRI IDTSLTRDPLVIELGQKQVIPGLEQSLLDMCVGEKRRAI IPSHLAYGKRGF 
PPS VPADAWQYDVEL I AL I RANYWLKLVKG I LP LVGMAMVP ALLGL I GYHLYRKANRPKVS 
KKKLKEEKRNKSKKK 
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FIGURE 54 

CCCGGGAACGTGTTCCTGGCTGCCGCACCCGAACAGCCTGTCCTGGTGCCCCGGCTCCCTGC 
CCCGCGCCCAGTCATGACCCTGCGCCCCTCACTCCTCCCGCTCCATCTGCTGCTGCTGCTGC 
TGCTCAGTGCGGCGGTGTGCCGGGCTGAGGCTGGGCTCGAAACCGAAAGTCCCGTCCGGACC 
CTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCCGAGCCCGCTGCTTTTGGAGA 
CACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACGTATTATTGACACCTCCCTGA 
CCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGATTCCAGGTCTGGAGCAGAGT 
CTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATTCCTTCTCACTTGGCCTATGG 
AAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGTGCAGTATGACGTGGAGCTGA 
TTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGGGCATTTTGCCTCTGGTAGGG 
ATGGCCATGGTGCCACCCTCCTGGGCCTCATTGGGTATCACCTATACAGAAAGGCCAATAGA 
CCCAAAGTCTCCAAAAAGAAGCT(^GGAAGAGAAACGAAACAAGAGCAAAAAGAAATAATA 
AATAATAAATTTTAAAAAACTTA 
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FIGURE 55 

CCGAAAGTCCCGTCCGGACCCTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCC 
GAGCCCGCTGCTTTTGGAGACACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACG 
TATTATTGACACCTCCCTGACCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGA 
TTCCAGGTCTGGAGCAGAGTCTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATT 
CCTTCTCACTTGGCCTATGGAAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGT 
GCAGTATGACGTGGAGCTGATTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGG 
GCATTTTGCCTCTGGTAGGGATGGCCATGGTGCCAGCCCTCCTGGGCCTCATTGGGTATCAC 
CTATACAGAAAGGCCAATAGACCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAA 
CAAGAGCAAAAAGAAATAATAAATAATAAATTTTAAAAAACTTAAAA 
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FIGURE 56 

CTGCTGCATCCGGGTGTCTGGAGGCTGTGGCCGTTTTGTTTTCTTGGCTAAAATCGGGGGAG 
TGAGGCGGGCCGGCGCGGCGCGACACCGGGCTCCGGAACCACTGCACGACGGGGCTGGACTG 
ACCTGAAAAAAATGTCTGGATTTCTAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGG 
GAAAAGCGCAATACTATTGCTTCCATTGCTGCTGGTGTACTATTTTTTACAGGCTGGTGGAT 
TATCATAGATGCAGCTGTTATTTATCCCACCATGAAAGATTTCAACCACTCATACCATGCCT 
GTGGTGTTATAGCAACCATAGCCTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGA 
GGTGATAGTTACAGTGAAGGTTGTCTGGGTCAAACAGGTGCTCGCATTTGGCTTTTCGTTGG 
TTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTTATGTTG 
CTAAAGAAAAAGACATAGTATACCCTGGAATTGCTGTATTTTTCCAGAATGCCTTCATCTTT 
TTTGGAGGGCTGGTTTTTAAGTTTGGCCGCACTGAAGACTTATGGCAG TGA ACACATCTGAT 
TTCCCACAGCACAACAGCCCTGCATGGGTTTGTTTGTTTTTTTACTGCTCACTCCCAACCTT 
TTGTAATGCCATTTTCTAAACTTATTTCTGAGTGTAGTCTCAGCTTAAAGTTGTGTAATACT 
AAAATCACGAGAACACCTAAACAACAACCAAAAATCTATTGTGGTATGCACTTGATTAACTT 
ATAAAATGTTAGAGGAAACTTTCACATGAATAATTTTTGTCAAATTTTATCATGGTATAATT 
TGTAAAAATAAAAAGAAATTACAAAAGAAATTATGGATTTGTCAATGTAAGTATTTGTCATA 
TCTGAGGTCCAAAACCACAATGAAAGTGCTCTGAAGATTTAATGTGTTTATTCAAATGTGGT 
CTCTTCTGTGTCAAATGTTAAATGAAATATAAACATTTTTTAGTTTTTAAAATATTCCGTGG 
TCAAAATTCTTCCTCACTATAATTGGTATTTACTTTTACCAAAAATTCTGTGAACATGTAAT 
GTAACTGGCTTTTGAGGGTCTCCCAAGGGGTGAGTGGACGTGTTGGAAGAGAGAAGCACCAT 
GGTCCAGCCACCAGGCTCCCTGTGTCCCTTCCATGGGAAGGTCTTCCGCTGTGCCTCTCATT 
CCAAGGGCAGGAAGATGTGACTCAGCCATGACACGTGGTTCTGGTGGGATGCACAGTCACTC 
CACATCCACCACTG 
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FIGURE 57 

MSGFLEGLRCSECIDWGEKRNTIASIAAGVLFFTGWWI IIDAAVIYPTMKDFNHSYHACGVI 
ATIAFLMINAVSNGQVRGDSYSEGCLGQTGARIWLFVGFMLAFGSLIASMWILFGGYVAKEK 
DIVYPGIAVFFQNAFI FFGGLVFKFGRTEDLWQ 
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FIGURE 58 

TTCTTGGCTAAAATCGGGGGAGTGAGGCGGGCCGGCGCGGCGCGACACCGGGCTCCGGAACC 
ACTGCACGACGGGGCTGGACTGACCTGAAAAAAATGTCTGGATTTCTAGAGGGCTTGAGATG 
CTCAGAATGCATTGACTGGGGGGAAAAGCGCAATACTATTGCTTCCATTGCTGCTGGTGTAC 
TATTTTTTACAGGCTGGTGGATTATCATAGATGCAGCTGTTATTTATCCCACCATGAAAGAT 
TTCAACCACTCATACCATGCCTGTGGTGTTATAGCAACCATAGCCTTCCTAATGATTAATGC 
AGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTTGTCTGGGTCAAACAGGTG 
CTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGG 
ATTCTTTTTGGAGGTTATGTTGCTAAAGAAAAAGACATAGTATACCCTGGAATTGCTGTATT 
TTTCCAGAATGCCTTCATCTTTTTTGGAGGGCTGGTTTTTAAGTTTGGC 
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FIGURE 59 

TGGACGGACCTGAAAAAAATGTTTGGATTTNTAGAGGGNTTGAGATGTTCAGAATGCATGAC 
TGGGGGAAAAGCGCAAATACTATTGCTTCCATTGCTGCTGGTGTANTATTTTTTACAGGCTG 
GTGGATTATC^TAGATGCAGNTGTTATTTATCCCACCATGAT^GATTTCAACCANTCATACC 
ATGCCTGTGGTGTTATAGCAACCATAGCCTTCNTAATGATTAATGCAGTATCGAATGGACAA 
GTCCGAGGTGATAGTTACAGTGAAGGTTGTTTGGGTCAAACAGGTGCTCGCATTTGGCTTTT 
CGTTGGTTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTT 
ATGTTGCTAAAGAAAT^AGACATAGTATACCCTGGAATTGNTGTATTTTTCCAGAATGCCTTC 
ATCTTTTTTGGAGGGCTGGTTTTTAAGTTTGGCCGCACTGAAGANTTATGGCAGTG 
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GGACACCGGGTTCCGGACCAATGCANGACGGGGTGGANTGACCTGAAAAAAATGTTTGGATT 
TTTAGAGGGCTTGAGATGNTCAGAATGCATTGACTGGGGGAAAAGCGCAATANTATTGCTTT 
CCATTGCTGCTGGTGTACTATTTTTTACAGGGTGGTGGATTATCATAGATGCAGCTGTTATT 
TATCCCACCATGAAAGATTTNAACCACTCATACCATGCCTGTGGTGTTATAGCAACCATAGC 
CTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTT 
GTTTGGGTCAAACAGGTGNTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGGATTT 
CTGATTGNATTCTATGCGGATTCTTCTTGGAGGTTATGTTGCTAAAGAAAAAGACATAGTAT 
ACCCTGGAATTNCTNTATTTTTCCAGAATGCC 
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FIGURE 61 

TAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGGGAAAAGCGCAATANTATTGCTTCC 
ATTGNTGNTGGTGTANTATTTTTTTACAGGCTGGTGGATTATNATAGATGCAGCTGTTATTT 
ATCCCACCATGAAAGATTTNAACCANTCATACCATGCCTGTGGTGTTATAGCAACCATAGCC 
TTCCTAATGATTAATGCAGTATNGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTTG 
TTTGGGTCAAACAGGTGNTNGCATTTGGCTTTTNGTTGGTTTCATGTTGGCCTTTGGATCTN 
TGATTGCATTTATGTGGATTNTTTTTGGAGGTTATGTTGCTAAAGNAAAAGACATAGTATAC 
CCTGT 
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FIGURE 62 



GGGAGGCTGTGNCCGTTTTGTTTTNTTGGCTAAAATCGGGGGAGTGAGGCGGCCCGGCGCGG 
CGNGACACCGGGTTCCGGGAACCATTGCACGACGGGGTGGACTGACCTGAAAAAAATGTTTG 
GATTTNTAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGGGAAAAGCGCAATACTATT 
GCTTCCATTGCTGCTGGTGTACTATTTTTTACAGGCTGGTGGATTATCATAGATGCAGCTGT 
TATTTATCCCACCATGAAAGATTTCAACCACTCATACCATGCCTGTGGTGTTATAGCAACCA 
TAGCCTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAA 
GGTTGTCTGGGTCAAACAGGTGCTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGG 
ATNTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTTATGTTGCTAAAGAAAAAGACATAG 
TATACCCTGGAATTGCTGTATTTTTCCAGAATGCCTTCATNTTTTTTGGAGGGCTG 



WO 99/63088 




PCIYUS99/12252 



FIGURE 63 

CGACGCCGGCGTGATGTGGCTTCCGCTGGTGCTGCTCCTGGCTGTGCTGCTGCTGGCCGTCC 
TCTGCAAAGTTTACTTGGGACTATTCTCTGGCAGCTCCCCGAATCCTTTCTCCGAAGATGTC 
AAACGGCCCCCAGCGCCCCTGGTAACTGACAAGGAGGCCAGGAAGAAGGTTCTCAAACAAGC 
TTTTTCAGCCAACCAAGTGCCGGAGAAGCTGGATGTGGTGGTAATTGGCAGTGGCTTTGGGG 
GCCTGGCTGCAGCTGCAATTCTAGCTAAAGCTGGCAAGCGAGTCCTGGTGCTGGAACAACAT 
ACCAAGGCAGGGGGCTGCTGTCATACCTTTGGAAAGAATGGCCTTGAATTTGACACAGGAAT 
CCATTACATTGGGCGTATGGAAGAGGGCAGCATTGGCCGTTTTATCTTGGACCAGATCACTG 
AAGGGCAGCTGGACTGGGCTCCCCTGTCCTCTCCTTTTGACATCATGGTACTGGAAGGGCCC 
AATGGCCGAAAGGAGTACCCCATGTACAGTGGAGAGAAAGCCTACATTCAGGGCCTCAAGGA 
GAAGTTTCCACAGGAGGAAGCTATCATTGACAAGTATATAAAGCTGGTTAAGGTGGTATCCA 
GTGGAGCCCCTCATGCCATCCTGTTGAAATTCCTCCCATTGCCCGTGGTTCAGCTCCTCGAC 
AGGTGTGGGCTGCTGACTCGTTTCTCTCCATTCCTTCAAGCATCCACCCAGAGCCTGGCTGA 
GGTCCTGCAGCAGCTGGGGGCCTCCTCTGAGCTCCAGGCAGTACTCAGCTACATCTTCCCCA 
CTTACGGTGTCACCCCCAACCACAGTGCCTTTTCCATGCACGCCCTGCTGGTCAACCACTAC 
ATGAAAGGAGGCTTTTATCCCCGAGGGGGTTCCAGTGAAATTGCCTTCCACACCATCCCTGT 
GATTCAGCGGGCTGGGGGCGCTGTCCTCACAAAGGCCACTGTGCAGAGTGTGTTGCTGGACT 
CAGCTGGGAAAGCCTGTGGTGTCAGTGTGAAGAAGGGGCATGAGCTGGTGAACATCTATTGC 
CCCATCGTGGTCTCCAACGCAGGACTGTTCAACACCTATGAACACCTACTGCCGGGGAACGC 
CCGCTGCCTGCCAGGTGTGAAGCAGCAACTGGGGACGGTGCGGCCCGGCTTAGGCATGACCT 
CTGTTTTCATCTGCCTGCGAGGCACCAAGGAAGACCTGCATCTGCCGTCCACCAACTACTAT 
GTTTACTATGACACGGACATGGACCAGGCGATGGAGCGCTACGTCTCCATGCCCAGGGAAGA 
GGCTGCGGAACACATCCCTCTTCTCTTCTTCGCTTTCCCATCAGCCAAAGATCCGACCTGGG 
AGGACCGATTCCCAGGCCGGTCCACCATGATCATGCTCATACCCACTGCCTACGAGTGGTTT 
GAGGAGTGGCAGGCGGAGCTGAAGGGAAAGCGGGGCAGTGACTATGAGACCTTCAAAAACTC 
CTTTGTGGAAGCCTCTATGTCAGTGGTCCTGAAACTGTTCCCACAGCTGGAGGGGAAGGTGG 
AGAGTGTGACTGCAGGATCCCCACTCACCAACCAGTTCTATCTGGCTGCTCCCCGAGGTGCC 
TGCTACGGGGCTGACCATGACCTGGGCCGCCTGCACCCTTGTGTGATGGCCTCCTTGAGGGC 
CCAGAGCCCCATCCCCAACCTCTATCTGACAGGCCAGGATATCTTCACCTGTGGACTGGTCG 
GGGCCCTGCAAGGTGCCCTGCTGTGCAGCAGCGCCATCCTGAAGCGGAACTTGTACTCAGAC 
CTTAAGAATCTTGATTCTAGGATCCGGGCACAGAAGAAAAAGAATTAGTTCCATCAGGGAGG 
AGTCAGAGGAATTTGCCCAATGGCTGGGGCATCTCCCTTGACTTACCCATAATGTCTTTCTG 
CATTAGTTCCTTGCACGTATAAAGCACTCTAATTTGGTTCTGATGCCTGAAGAGAGGCCTAG 
TTTAAATCACAATTCCGAATCTGGGGCAATGGAATCACTGCTTCCAGCTGGGGCAGGTGAGA 
TCTTTACGCCTTTTATAACATGCCATCCCTACTAATAGGATATTGACTTGGATAGCTTGATG 
TCTCATGACGAGCGGCGCTCTGCATCCCTCACCCATGCCTCCTAACTCAGTGATCAAAGCGA 
ATATTCCATCTGTGGATAGAACCCCTGGCAGTGTTGTCAGCTCAACCTGGTGGGTTCAGTTC 
TGTCCTGAGGCTTCTGCTCTCATTCATTTAGTGCTACGCTGCACAGTTCTACACTGTCAAGG 
GAAAAGGGAGACTAATGAGGCTTAACTCAAAACCTGGGCGTGGTTTTGGTTGCCATTCCATA 
GGTTTGGAGAGCTCTAGATCTCTTTTGTGCTGGGTTCAGTGGCTCTTCAGGGGACAGGAAAT 
GCCTGTGTCTGGCCAGTGTGGTTCTGGAGCTTTGGGGTAACAGCAGGATCCATCAGTTAGTA 
GGGTGCATGTCAGATGATCATATCCAATTCATATGGAAGTCCCGGGTCTGTCTTCCTTATCA 
TCGGGGTGGCAGCTGGTTCTCAATGTGCCAGCAGGGACTCAGTACCTGAGCCTCAATCAAGC 
CTTATCCACCAAATACACAGGGAAGGGTGATGCAGGGAAGGGTGACATCAGGAGTCAGGGCA 
TGGACTGGTAAGATGAATACTTTGCTGGGCTGAAGCAGGCTGCAGGGCATTCCAGCCAAGGG 
CACAGCAGGGGACAGTGCAGGGAGGTGTGGGGTAAGGGAGGGAAGTCACATCAGAAAAGGGA 
AAGCCACGGAATGTGTGTGAAGCCCAGAAATGGCATTTGCAGTTAATTAGCACATGTGAGGG 
TTAGACAGGTAGGTGAATGCAAGCTCAAGGTTTGGAAAAATGACTTTTCAGTTATGTCTTTG 
GTATCAGACATACGAAAGGTCTCTTTGTAGTTCGTGTTAATGTAACATTAATAAATTTATTG 
ATTCCATTGCTTTAAAAAAAAAAAAAAA 
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FIGURE 64 

MWLPLVLLLAVLLIAVLCKVYLGLFSGSSPNPFSEDVKRPPAPLVTDKEARKKVLKQAFSAN 
QVPEKLDVWIGSGFGGLAAAAILAKAGKRVLVLEQHTKAGGCCHTFGKNGLEFDTGIHYIG 
RMEEGSIGRFILDQITEGQLDWAPLSSPFDIMVLEGPNGRKEYPMYSGEKAYIQGLKEKFPQ 
EEAIIDKYIKLVKWSSGAPHAILLKFLPLPWQLLDRCGLLTRFSPFLQASTQSLAEVLQQ 
LGASSELQAVLSYIFPTYGVTPNHSAFSMHALLVNHYMKGGFYPRGGSSEIAFHTIPVIQRA 
GGAVLTKATVQSVLLDSAGKACGVSVKKGHELVNIYCPIWSNAGLFNTYEHLLPGNARCLP 
GVKQQLGTVRPGLGMTSVFICLRGTKEDLHLPSTNYYVYYDTDMDQAMERYVSMPREEAAEH 
IPLLFFAFPSAKDPTWEDRFPGRSTMIMLIPTAYEWFEEWQAELKGKRGSDYETFKNSFVEA 
SMSVVLKLFPQLEGKVESVTAGSPLTNQFY3LAAPRGACYGADHDLGRLHPCVMASLRAQSPI 
PNLYLTGQDIFTCGLVGALQGALLCSSAILKRNLYSDLKNLDSRIRAQKKKN 
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FIGURE 65 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCTA 
GGGGTTGGCACCGGCCCCGAGAGGAGGATGCGGGTCCGGATAGGGCTGACGCTGCTGCTGTG 
TGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGGATGAAGAAGGCAGCCAGGATGAATCCT 
TAGATTCCAAGACTACTTTGACATCAGATGAGTCAGTAAAGGACCATACTACTGCAGGCAGA 
GTAGTTGCTGGTCAAATATTTCTTGATTCAGAAGAATCTGAATTAGAATCCTCTATTCAAGA 
AGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAGAAGATATCAGCTTTCTAG 
AGTCTCCAAATCCAGAAAACAAGGACTATGAAGAGCCAAAGAAAGTACGGAAACCAGCTTTG 
ACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCTTTTCCTAGATAA 
GGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTGCTACAACCT 
ATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCTAAGAGA 
CGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAGCAA 
TAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATC 
CAGGCAGCGAGAGAGATGTTTGAGAAGCTGACTGAGGAAGGCTCTCCCAAGGGACAGACTGC 
TCTTGGCTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTAT 
ATTATACATTTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTT 
TAGTGGAAGGCTAATAATATTAACATCAGAAGAATTTGTGGTTTATAGCGGCCACAACTTTT 
TCAGCTTTCATGATCCAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTGCTTCAA 
ATTCTTGTTAATGGATATAACACATGGAATCTACATGTAAATGAAAGTTGGTGGAGTCCACA 
ATTTTTCTTTAAAATGATTAGTTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGC 
TCTTTTTAAATTTTCTCTGAGTTGGAATTGTCAGAATCATTTTTTACATTAGATTATCATAA 
TTTTAAAAATTTTTCTTTAGTTTTTCAAAATTTTGTAAATGGTGGCTATAGAAAAACAACAT 
GAAATATTATACAATATTTTGCAACAATGCCCTAAGAATTGTTAAAATTCATGGAGTTATTT 
GTGCAGAATGACTCCAGAGAGCTCTACTTTCTGTTTTTTACTTTTCATGATTGGCTGTCTTC 
CCATTTATTCTGGTCATTTATTGCTAGTGACACTGTGCCTGCTTCCAGTAGTCTCATTTTCC 
CTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGGAAGATTAACTCATTTTTAATAAA 
ATTATGTCTAAGATTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 66 

MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLD 
SEESELESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHG 
EPCHFPFLFLDKEYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMM 
YQTGMKILNGSNKKSQKREAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEK 
LTEEGSPKGQTALGFLYASGLGVNSSQAKALVYYTFGALGGNLIAHMVLVSRL 
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FIGURE 67 

CTTCCCAGCCCTGTGCCCCAAAGCACCTGGAGCATATAGCCTTGCAGAACTTCTACTTGCCT 
GCCTCCCTGCCTCTGGCCATGGCCTGCCGGTGCCTCAGCTTCCTTCTGATGGGGACCTTCCT 
GTCAGTTTCCCAGACAGTCCTGGCCCAGCTGGATGCACTGCTGGTCTTCCCAGGCCAAGTGG 
CTCAACTCTCCTGCACGCTCAGCCCCCAGCACGTCACCATCAGGGACTACGGTGTGTCCTGG 
TACCAGCAGCGGGCAGGCAGTGCCCCTCGATATCTCCTCTACTACCGCTCGGAGGAGGATCA 
CCACCGGCCTGCTGACATCCCCGATCGATTCTCGGCAGCCAAGGATGAGGCCCACAATGCCT 
GTGTCCTCACCATTAGTCCCGTGCAGCCTGAAGACGACGCGGATTACTACTGCTCTGTTGGC 
TACGGCTTTAGTCCCTAGGGGTGGGGTGTGAGATGGGTGCCTCCCCTCTGCCTCCCATTTCT 
GCCCCTGACCTTGGGTCCCTTTTAAACTTTCTCTGAGCCTTGCTTCCCCTCTGTAAAATGGG 
TTAATAATATTCAACATGTCAACAAC 



WO 99/63088 / _ PCT/US99/12252 

FIGURE 68 

I^CRCLSFLLMGTFLSVSQTVLAQLDALLVFPGQVAQLSCTLSPQHVTIRDYGVSWYQQRAG 
S APRYLLYYRSEEDHHRPAD I PDRFS AAKDEAHNACVLT I S P VQPEDDADYYCS VGYGFS P. 
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GCCGCCCCGCCCCGAGACCGGGCCCGGGGGCGCGGGGCGGCGGGATGCGGCGCCCGGGGCGG 
CGATGACCGCGGAGCGCACGCCGCGGGCCCGGCCCTGACCCCGCCGCCCGCCCGCTGAGCCC 
CCCGCCGAGGTCCGGACAGGCCGAGATGACGCCGAGCCCCCTGTTGCTGCTCCTGCTGCCGC 
CGCTGCTGCTGGGGGCCTTCCCACCGGCCGCCGCCGCCCGAGGCCCCCCAAAGATGGCGGAC 
AAGGTGGTCCCACGGCAGGTGGCCCGGCTGGGCCGCACTGTGCGGCTGCAGTGCCCAGTGGA 
GGGGGACCCGCCGCCGCTGACCATGTGGACCAAGGATGGCCGCACCATCCACAGCGGCTGGA 
GCCGCTTCCGCGTGCTGCCGCAGGGGCTGAAGGTGAAGCAGGTGGAGCGGGAGGATGCCGGC 
GTGTACGTGTGCAAGGCCACCAACGGCTTCGGCAGCCTGAGCGTCAACTACACCCTCGTCGT 
GCTGGATGACATTAGCCCAGGGAAGGAGAGCCTGGGGCCCGACAGCTCCTCTGGGGGTCAAG 
AGGACCCCGCCAGCCAGCAGTGGGCACGACCGCGCTTCACACAGCCCTCCAAGATGAGGCGC 
CGGGTGATCGCACGGCCCGTGGGTAGCTCCGTGCGGCTCAAGTGCGTGGCCAGCGGGCACCC 
TCGGCCCGACATCACGTGGATGAAGGACGACCAGGCCTTGACGCGCCCAGAGGCCGCTGAGC 
CCAGGAAGAAGAAGTGGACACTGAGCCTGAAGAACCTGCGGCCGGAGGACAGCGGCAAATAC 
ACCTGCCGCGTGTCGAACCGCGCGGGCGCCATCAACGCCACCTACAAGGTGGATGTGATCCA 
GCGGACCCGTTCCAAGCCCGTGCTCACAGGCACGCACCCCGTGAACACGACGGTGGACTTCG 
GGGGGACCACGTCCTTCCAGTGCAAGGTGCGCAGCGACGTGAAGCCGGTGATCCAGTGGCTG 
AAGCGCGTGGAGTACGGCGCCGAGGGCCGCCACAACTCCACCATCGATGTGGGCGGCCAGAA 
GTTTGTGGTGCTGCCCACGGGTGACGTGTGGTCGCGGCCCGACGGCTCCTACCTCAATAAGC 
TGCTCATCACCCGTGCCCGCCAGGACGATGCGGGCATGTACATCTGCCTTGGCGCCAACACC 
ATGGGCTACAGCTTCCGCAGCGCCTTCCTCACCGTGCTGCCAGACCCAAAACCGCCAGGGCC 
ACCTGTGGCCTCCTCGTCCTCGGCCACTAGCCTGCCGTGGCCCGTGGTCATCGGCATCCCAG 
CCGGCGCTGTCTTCATCCTGGGCACCCTGCTCCTGTGGCTTTGCCAGGCCCAGAAGAAGCCG 
TGCACCCCCGCGCCTGCCCCTCCCCTGCCTGGGCACCGCCCGCCGGGGACGGCCCGCGACCG 
CAGCGGAGACAAGGACCTTCCCTCGTTGGCCGCCCTCAGCGCTGGCCCTGGTGTGGGGCTGT 
GTGAGGAGCATGGGTCTCCGGCAGCCCCCCAGCACTTACTGGGCCCAGGCCCAGTTGCTGGC 
CCTAAGTTGTACCCCAAACTCTACACAGACATCCACACACACACACACACACACTCTCACAC 
ACACTCACACGTGGAGGGCAAGGTCCACCAGCACATCCACTATCAGTGCTAGACGGCACCGT 
ATCTGCAGTGGGCACGGGGGGGCCGGCCAGACAGGCAGACTGGGAGGATGGAGGACGGAGCT 
GCAGACGAAGGCAGGGGACCCATGGCGAGGAGGAATGGCCAGCACCCCAGGCAGTCTGTGTG 
TGAGGCATAGCCCCTGGACACACACACACAGACACACACACTACCTGGATGCATGTATGCAC 
ACACATGCGCGCACACGTGCTCCCTGAAGGCACACGTACGCACACGCACATGCACAGATATG 
CCGCCTGGGCACACAGATAAGCTGCCCAAATGCACGCACACGCACAGAGACATGCCAGAACA 
TACAAGGACATGCTGCCTGAACATACACACGCACACCCATGCGCAGATGTGCTGCCTGGACA 
CACACACACACACGGATATGCTGTCTGGACGCACACACGTGCAGATATGGTATCCGGACACA 
CACGTGCACAGATATGCTGCCTGGACACACAGATAATGCTGCCTTGACACACACATGCACGG 
ATATTGCCTGGACACACACACACACACACGCGTGCACAGATATGCTGTCTGGACACGCACAC 
ACATGCAGATATGCTGCCTGGACACACACTTCCAGACACACGTGCACAGGCGCAGATATGCT 
GCCTGGACACACGCAGATATGCTGTCTAGTCACACACACACGCAGACATGCTGTCCGGACAC 
ACACACGCATGCACAGATATGCTGTCCGGACACACACACGCACGCAGATATGCTGCCTGGAC 
ACAC AC AC AGATAATGCTGC CTCAACACTCACACACGTGCAGATATTGC CTGGACACACAC A 
TGTGCACAGATATGCTGTCTGGACATGCACACACGTGCAGATATGCTGTCCGGATACACACG 
CACGCACACATGCAGATATGCTGCCTGGGCACACACTTCCGGACACACATGCACACACAGGT 
GCAGATATGCTGCCTGGACACACACACAGATAATGCTGCCTCAACACTCACACACGTGCAGA 
TATTGCCTGGACACACACATGTGCACAGATATGCTGTCTGGACATGCACACACGTGCAGATA 
TGCTGTCCGGATACACACGCACGCACACATGCAGATATGCTGCCTGGGCACACACTTCCGGA 
CACACATGCACACACAGGTGCAGATATGCTGCCTGGACACACGCAGACTGACGTGCTTTTGG 
GAGGGTGTGCCGTGAAGCCTGCAGTACGTGTGCCGTGAGGCTCATAGTTGATGAGGGACTTT 
CCCTGCTCCACCGTCACTCCCCCAACTCTGCCCGCCTCTGTCCCCGCCTCAGTCCCCGCCTC 
CATCCCCGCCTCTGTCCCCTGGCCTTGGCGGCTATTTTTGCCACCTGCCTTGGGTGCCCAGG 
AGTCCCCTACTGCTGTGGGCTGGGGTTGGGGGCACAGCAGCCCCAAGCCTGAGAGGCTGGAG 
CCCATGGCTAGTGGCTCATCCCCAGTGCATTCTCCCCCTGACACAGAGAAGGGGCCTTGGTA 
TTTATATTTAAGAAATGAAGATAATATTAATAATGATGGAAGGAAGACTGGGTTGCAGGGAC 
TGTGGTCTCTCCTGGGGCCCGGGACCCGCCTGGTCTTTCAGCCATGCTGATGACCACACCCC 
GTCCAGGCCAGACACCACCCCCCACCCCACTGTCGTGGTGGCCCCAGATCTCTGTAATTTTA 
TGTAGAGTTTGAGCTGAAGCCCCGTATATTTAATTTATTTTGTTAAACACAAAA 
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FIGURE 70 

MTPSPLLLLLLPPLLLGAFPPAAAARGPPKMADKWPRQVARLGRTVRLQCPVEGDPPPLTM 
WTKDGRTIHSGWSRFRVLPQGLKVKQVEREDAGVYVCKATNGFGSLSVNYTLWLDDISPGK 
ESLGPDSSSGGQEDPASQQWARPRFTQPSKMRRRVIARPVGSSVRLKCVASGHPRPDITWMK 
DDQALTRPEAAEPRKKKWTLSLKNLRPEDSGKYTCRVSNRAGAINATYKVDVIQRTRSKPVL 
TGTHPVNTTVDFGGTTSFQCKVRSDVKPVIQWLKRVEYGAEGRHNSTIDVGGQKFWLPTGD 
VWSRPDGSYLNKLLITRARQDDAGMYICLGANTMGYSFRSAFLTVLPDPKPPGPPVASSSSA 
TSLPWPWIGIPAGAVFILGTLLLWLCQAQKKPCTPAPAPPLPGHRPPGTARDRSGDKDLPS 
LAALSAGPGVGLCEEHGSPAAPQHLLGPGPVAGPKLYPKLYTDIHTHTHTHSHTHSHVEGKV 
HQHIHYQC 
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FIGURE 71A 

CCCAGCTGAGGAGCCCTGCTCAAGACACGGTCACTGGATCTGAGAAACTTCCCAGGGGACCG 
CATTCCAGAGTCAGTGACTCTGTGAAGCACCCACATCTACCTCTTGCCACGTTCCCACGGGC 
TTGGGGGAAAGATGGTGGGGACCAAGGCCTGGGTGTTCTCCTTCCTGGTCCTGGAAGTCACA 
TCTGTGTTGGGGAGACAGACGATGCTCACCCAGTCAGTAAGAAGAGTCCAGCCTGGGAAGAA 
GAACCCCAGCATCTTTGCCAAGCCTGCCGACACCCTGGAGAGCCCTGGTGAGTGGACAACAT 
GGTTCAACATCGACTACCCAGGCGGGAAGGGCGACTATGAGCGGCTGGACGCCATTCGCTTC 
TACTATGGGGACCGTGTATGTGCCCGTCCCCTGCGGCTAGAGGCTCGGACCACTGACTGGAC 
ACCTGCGGGCAGCACTGGCCAGGTGGTCCATGGTAGTCCCCGTGAGGGTTTCTGGTGCCTCA 
ACAGGGAGCAGCGGCCTGGCCAGAACTGCTCTAATTACACCGTACGCTTCCTCTGCCCACCA 
GGATCCCTGCGCCGAGACACAGAGCGCATCTGGAGCCCATGGTCTCCCTGGAGCAAGTGCTC 
AGCTGCCTGTGGTCAGACTGGGGTCCAGACTCGCACACGCATTTGCTTGGCAGAGATGGTGT 
CGCTGTGCAGTGAGGCCAGCGAAGAGGGTCAGCACTGCATGGGCCAGGACTGTACAGCCTGT 
GACCTGACCTGCCCAATGGGCCAGGTGAATGCTGACTGTGATGCCTGCATGTGCCAGGACTT 
CATGCTTCATGGGGCTGTCTCCCTTCCCGGAGGTGCCCCAGCCTCAGGGGCTGCTATCTACC 
TCCTGACCAAGACGCCGAAGCTGCTGACCCAGACAGACAGTGATGGGAGATTCCGAATCCCT 
GGCTTGTGCCCTGATGGCAAAAGCATCCTGAAGATCACAAAGGTCAAGTTTGCCCCCATTGT 
ACTCACAATGCCCAAGACTAGCCTGAAGGCAGCCACCATCAAGGCAGAGTTTGTGAGGGCAG 
AGACTCCATACATGGTGATGAACCCTGAGACAAAAGCACGGAGAGCTGGGCAGAGCGTGTCT 
CTGTGCTGTAAGGCCACAGGGAAGCCCAGGCCAGACAAGTATTTTTGGTATCATAATGACAC 
ATTGCTGGATCCTTCCCTCTACAAGCATGAGAGCAAGCTGGTGCTGAGGAAACTGCAGCAGC 
ACCAGGCTGGGGAGTACTTTTGCAAGGCCCAGAGTGATGCTGGGGCTGTGAAGTCCAAGGTT 
GCCCAGCTGATTGTCACAGCATCTGATGAGACTCCTTGCAACCCAGTTCCTGAGAGCTATCT 
TATCCGGCTGCCCCATGATTGCTTTCAGAATGCCACCAACTCCTTCTACTATGACGTGGGAC 
GCTGCCCTGTTAAGACTTGTGCAGGGCAGCAGGATAATGGGATCAGGTGCCGTGATGCTGTG 
CAGAACTGCTGTGGCATCTCCAAGACAGAGGAAAGGGAGATCCAGTGCAGTGGCTACACGCT 
ACCCACCAAGGTGGCCAAGGAGTGCAGCTGCCAGCGGTGTACGGAAACTCGGAGCATCGTGC 
GGGGCCGTGTCAGTGCTGCTGACAATGGGGAGCCCATGCGCTTTGGCCATGTGTACATGGGG 
AACAGCCGTGTAAGCATGACTGGCTACAAGGGCACTTTCACCCTCCATGTCCCCCAGGACAC 
TGAGAGGCTGGTGCTCACATTTGTGGACAGGCTGCAGAAGTTTGTCAACACCACCAAAGTGC 
TACCTTTCAACAAGAAGGGGAGTGCCGTGTTCCATGAAATCAAGATGCTTCGTCGGAAAGAG 
CCCATCACTTTGGAAGCCATGGAGACCAACATCATCCCCCTGGGGGAAGTGGTTGGTGAAGA 
CCCCATGGCTGAACTGGAGATTCCATCCAGGAGTTTCTACAGGCAGAATGGGGAGCCCTACA 
TAGGAAAAGTGAAGGCCAGTGTGACCTTCCTGGATCCCCGGAATATTTCCACAGCCACAGCT 
GCCCAGACTGACCTGAACTTCATCAATGACGAAGGAGACACTTTCCCCCTTCGGACGTATGG 
CATGTTCTCTGTGGACTTCAGAGATGAGGTCACCTCAGAGCCACTTAATGCTGGCAAAGTGA 
AGGTCCACCTTGACTCGACCCAGGTCAAGATGCCAGAGCACATATCCACAGTGAAACTCTGG 
TCACTCAATCCAGACACAGGGCTGTGGGAGGAGGAAGGTGATTTCAAATTTGAAAATCAAAG 
GAGGAACAAAAGAGAAGACAGAACCTTCCTGGTGGGCAACCTGGAGATTCGTGAGAGGAGGC 
TCTTTAACCTGGATGTTCCTGAAAGCAGGCGGTGCTTTGTTAAGGTGAGGGCCTACCGGAGT 
GAGAGGTTCTTGCCTAGTGAGCAGATCCAGGGGGTTGTGATCTCCGTGATTAACCTGGAGCC 
TAGAACTGGCTTCTTGTCCAACCCTAGGGCCTGGGGCCGCTTTGACAGTGTCATCACAGGCC 
CCAACGGGGCCTGTGTGCCTGCCTTCTGTGATGACCAGTCCCCTGATGCCTACTCTGCCTAT 
GTCTTGGCAAGCCTGGCTGGGGAGGAACTGCAAGCAGTGGAGTCTTCTCCTAAATTCAACCC 
AAATGCAATTGGCGTCCCTCAGCCCTATCTCAACAAGCTCAACTACCGTCGGACGGACCATG 
AGGATCCACGGGTTAAAAAGACAGCTTTCCAGATTAGCATGGCCAAGCCAAGGCCCAACTCA 
GCTGAGGAGAGCAATGGGCCCATCTATGCCTTTGAGAACCTCCGGGCATGTGAAGAGGCACC 
ACCCAGTGCAGCCCACTTCCGGTTCTACCAGATTGAGGGGGATCGATATGACTACAACACAG 
TCCCCTTCAACGAAGATGACCCTATGAGCTGGACTGAAGACTATCTGGCATGGTGGCCAAAG 
CCGATGGAATTCAGGGCCTGCTATATCAAGGTGAAGATTGTGGGGCCACTGGAAGTGAATGT 
GCGATCCCGCAACATGGGGGGCACTCATCGGCGGACAGTGGGGAAGCTGTATGGAATCCGAG 
ATGTGAGGAGCACTCGGGACAGGGACCAGCCCAATGTCTCAGCTGCCTGTCTGGAGTTCAAG 
TGCAGTGGGATGCTCTATGATCAGGACCGTGTGGACCGCACCCTGGTGAAGGTCATCCCCCA 
GGGCAGCTGCCGTCGAGCCAGTGTGAACCCCATGCTGCATGAGTACCTGGTCAACCACTTGC 
CACTTGCAGTCAACAACGACACCAGTGAGTACACCATGCTGGCACCCTTGGACCCACTGGGC 
CACAACTATGGCATCTACACTGTCACTGACCAGGACCCTCGCACGGCCAAGGAGATCGCGCT 
CGGCCGGTGCTTTGATGGCACATCCGATGGCTCCTCCAGAATCATGAAGAGCAATGTGGGAG 
TAGCCCTCACCTTCAACTGTGTAGAGAGGCAAGTAGGCCGCCAGAGTGCCTTCCAGTACCTC 
CAAAGCACCCCAGCCCAGTCCCCTGCTGCAGGCACTGTCCAAGGAAGAGTGCCCTCGAGGAG 
GCAGCAGCGAGCGAG CAGGGGTGG C CAGCG C CAGGGTGG AGTGGTGG CCTCT CTG AGATTT C 
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CTAGAGTTGCTCAACAGCCCCTGATCAACTAAGTTTTGTGGTACTTCACCCTCTTCTGCCCT 
CATTTCATGTGACAGCCATTGTGAGACTGATGCACAAACTGTCACTTGGTTAATTTAAGCAC 
TTCTGTTTTCGTGAATTTGCTTGTTTGTTTCTTCATGCCTTTACTTACTTTGTCCCATGCTA 
CTGATTGGCACGTGGCCCCCACAATGGCACAATAAAGCCCCTTTGTGAAACTGTTCTTTAAA 
TGAAACACAAGAAATTGGCCACTGGTAAAACTCTGCAGCTTCAACTGTACTTCATTTAATGC 
CATTAATGCAAATATACTTCCTCTTCTTTTTGCATGGTTTTGCCCACCTCTGCAATAGTGAT 
AATCTGATGCTGAAGATCAAATAACCAATATAAAGCATATTTCTTGGCCTTGCTCCACAGGA 
CATAGGCAAGCCTTGATCATAGTTCATACATATAAATGGTGGTGAAATAAAGAAATAAAACA 
CAATACTTTTACTTGAAATGTAAATAACTTATTTATTTCTTTGCTAAATTTGGAATTCTAGT 
GCACATTCAAAGTTAAGCTATTAAATATAGGGTGATCATAGTTCCTCTACCAAGTCTGGAAA 
GAACATCTCCTGGTATCCACAATTACACCAGGTTGCTAACTGTATTTGTACATTTCCCTTTG 
CATTCGCTTTTGTTCTTGCTAGAAACCCAGTGTAGCCCAGGGCAGATGTCAATAAATGCATA 
CTCTGTATTTCGAAAAAA 
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FTGTJRE 72 

MVGTKAWVFSFLVLEVTSVLGRQTMLTQSVRRVQPGKKNPS I FAKP ADTLES PGEWTTWFN I 
DYPGGKGDYERLDAIRFYYGDRVCARPLRLEARTTDWTPAGSTGQWHGSPREGFWCLNREQ 
RPGQNCSNYTVRFLCPPGSLRRDTERIWSPWSPWSKCSAACGQTGVQTRTRICLAEMVSLCS 
EASEEGQHCMGQDCTACDLTCPMGQVNADCDACMCQDFMLHGAVSLPGGAPASGAAIYLLTK 
TPKLLTQTDSDGRFRIPGLCPDGKSILKITKVKFAPIVLTMPKTSLKAATIKAEFVRAETPY 
MVMNPETKARRAGQSVSLCCKATGKPRPDKYFWYHNDTLLDPSLYKHESKLVLRKLQQHQAG 
EYFCKAQSDAGAVKSKVAQLIVTASDETPCNPVPESYLIRLPHDCFQNATNSFYYDVGRCPV 
KTCAGQQDNGIRCRDAVQNCCGISKTEEREIQCSGYTLPTKVAKECSCQRCTETRSIVRGRV 
SAADNGEPMRFGHVYMGNSRVSMTGYKGTFTLHVPQDTERLVLTFVDRLQKFVNTTKVLPFN 
KKGSAVFHEIKMLRRKEPITLEAMETNIIPLGEVVGEDPMAELEIPSRSFYRQNGEPYIGKV 
KASVTFLDPRNISTATAAQTDLNFINDEGDTFPLRTYGMFSVDFRDEVTSEPLNAGKVKVHL 
DSTQVKMPEHISTVKLWSLNPDTGLWEEEGDFKFENQRRNKREDRTFLVGNLEIRERRLFNL 
DVPESRRCFVKVRAYRSERFLPSEQIQGWISVINLEPRTGFLSNPRAWGRFDSVITGPNGA 
CVPAFCDDQSPDAYSAYVLASLAGEELQAVESSPKFNPNAIGVPQPYLNKLNYRRTDHEDPR 
VKKTAFQISMAKPRPNSAEESNGPIYAFENLRACEEAPPSAAHFRFYQIEGDRYDYNTVPFN 
EDDPMSWTEDYLAWWPKPMEFRACYIKVKIVGPLEVWRSRl^GGTHRRTVGKLYGIRDVRS 
TRDRDQPNVSAACLEFKCSGMLYDQDRVDRTLVKVIPQGSCRRASVNPMLHEYLVNHLPLAV 
NNDTSEYTMLAPLDPLGHNYGIYTVTDQDPRTAKEIALGRCFDGTSDGSSRIMKSNVGVALT 
FNCVERQVGRQSAFQYLQSTPAQSPAAGTVQGRVPSRRQQRASRGGQRQGGWASLRFPRVA 
QQPLIN 
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CTGCAAGTTGTTAACGCCTAACACACAAGTATGTTAGGCTTCCACCAAAGTCCTCAATATAC 
CTGAATACGCACAATATCTTAACTCTTCATATTTGGTTTTGGGATCTGCTTTGAGGTCCCAT 
CTTCATTTAAAAAAAAATACAGAGACCTACCTACCCGTACGCATACATACATATGTGTATAT 
ATATGTAAACTAGACAAAGATCGCAGATCATAAAGCAAGCTCTGCTTTAGTTTCCAAGAAGA 
TTACAAAGAATTTAGAGATGTATTTGTCAAGATCCCTGTCGATTCATGCCCTTTGGGTTACG 
GTGTCCTCAGTGATGCAGCCCTACCCTTTGGTTTGGGGACATTATGATTTGTGTAAGACTCA 
GATTTACACGGAAGAAGGGAAAGTTTGGGATTACATGGCCTGCCAGCCGGAATCCACGGACA 
TGACAAAATATCTGAAAGTGAAACTCGATCCTCCGGATATTACCTGTGGAGACCCTCCTGAG 
ACGTTCTGTGCAATGGGCAATCCCTACATGTGCAATAATGAGTGTGATGCGAGTACCCCTGA 
GCTGGCACACCCCCCTGAGCTGATGTTTGATTTTGAAGGAAGACATCCCTCCACATTTTGGC 
AGTCTGCCACTTGGAAGGAGTATCCCAAGCCTCTCCAGGTTAACATCACTCTGTCTTGGAGC 
AAAACCATTGAGCTAACAGACAACATAGTTATTACCTTTGAATCTGGGCGTCCAGACCAAAT 
GATCCTGGAGAAGTCTCTCGATTATGGACGAACATGGCAGCCCTATCAGTATTATGCCACAG 
ACTGCTTAGATGCTTTTCACATGGATCCTAAATCCGTGAAGGATTTATCACAGCATACGGTC 
TTAGAAATCATTTGCACAGAAGAGTACTCAACAGGGTATACAACAAATAGCAAAATAATCCA 
CTTTGAAATCAAAGACAGGTTCGCGCTTTTTGCTGGACCTCGCCTACGCAATATGGCTTCCC 
TCTACGGACAGCTGGATACAACCAAGAAACTCAGAGATTTCTTTACAGTCACAGACCTGAGG 
ATAAGGCTGTTAAGACCAGCCGTTGGGGAAATATTTGTAGATGAGCTACACTTGGCACGCTA 
CTTTTACGCGATCTCAGACATAAAGGTGCGAGGAAGGTGCAAGTGTAATCTCCATGCCACTG 
TATGTGTGTATGACAACAGCAAATTGACATGCGAATGTGAGCACAACACTACAGGTCCAGAC 
TGTGGGAAATGCAAGAAGAATTATCAGGGCCGACCTTGGAGTCCAGGCTCCTATCTCCCCAT 
CCCCAAAGGCACTGCAAATACCTGTATCCCCAGTATTTCCAGTATTGGTACGAATGTCTGCG 
ACAACGAGCTCCTGCACTGCCAGAACGGAGGGACGTGCCACAACAACGTGCGCTGCCTGTGC 
CCGGCCGCATACACGGGCATCCTCTGCGAGAAGCTGCGGTGCGAGGAGGCTGGCAGCTGCGG 
CTCCGACTCTGGCCAGGGCGCGCCCCCGCACGGCACCCCAGCGCTGCTGCTGCTGACCACGC 
TGCTGGGAACCGCCAGCCCCCTGGTGTTCTAGGTGTCACCTCCAGCCACACCGGACGGGCCT 
GTGCCGTGGGGAAGCAGACACAACCCAAACATTTGCTACTAACATAGGAAACACACACATAC 
AGACACCCCCACTCAGACAGTGTACAAACTAAGAAGGCCTAACTGAACTAAGCCATATTTAT 
CACCCGTGGACAGCACATCCGAGTCAAGACTGTTAATTTCTGACTCCAGAGGAGTTGGCAGC 
TGTTGATATTATCACTGCAAATCACATTGCCAGCTGCAGAGCATATTGTGGATTGGAAAGGC 
TGCGACAGCCCCCCAAACAGGAAAGACAAAAAACAAACAAATCAACCGACCTAAAAACATTG 
GCTACTCTAGCGTGGTGGGCCCTAGTACGACTCCGCCCAGTGTGTGGACCAACCAAATAGCA 
TTCTTTGCTGTCAGGTGCATTGTGGGCATAAGGAAATCTGTTACAAGCTGCCATATTGGCCT 
GCTTCCGTCCCTGAATCCCTTCCAACCTGTGCTTTAGTGAACGTTGCTCTGTAACCCTCGTT 
GGTTGAAAGATTTCTTTGTCTGATGTTAGTGATGCACATGTGTAACAGCCCCCTCTAAAAGC 
GCAAGCCAGTCATACCCCTGTATATCTTAGCAGCACTGAGTCCAGTGCGAGCACACACCCAC 
TATACAAGAGTGGCTATAGGAAAAAAGAAAGTGTATCTATCCTTTTGTATTCAAATGAAGTT 
ATTTTTCTTGAACTACTGTAATATGTAGATTTTTTGTATTATTGCCAATTTGTGTTACCAGA 
CAATCTGTTAATGTATCTAATTCGAATCAGCAAAGACTGACATTTTATTTTGTCCTCTTTCG 
TTCTGTTTTGTTTCACTGTGCAGAGATTTCTCTGTAAGGGCAACGAACGTGCTGGCATCAAA 
GAATATCAGTTTACATATATAACAAGTGTAATAAGATTCCACCAAAGGACATTCTAAATGTT 
TTCTTGTTGCTTTAACACTGGAAGATTTAAAGAATAAAAACTCCTGCATAAACGATTTCAGG 
AATTTGTATTGCAATTTCTTAAGATGAAAGGAACAGCCACCAAGCAGTTTCACACTCACTTT 
ACTGATTTCTGTGTGGACTGAGTACATTCAGCTGACGAATTTAGTTCCCAGGAAGATGGATT 
GATGTTCACTAGCTTGGACAACTTCTGCAAAATATGAGACTATTTCCACTTGGGAAAAATTA 
CAACAGCAAAAAAAAAAAAAAAAAAAAAA 
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MYLSRSLSIHALWTVSSVMQPYPLWGHYDLC^ 

VKLDPPDITCGDPPETFCAMGNPYMCNNECDASTPELAHPPELMFDFEGRHPSTFWQSATWK 
EYPKPLQVNITLSWSKTIELTDNIVITFESGRPDQMILEKSLDYGRTWQPYQYYATDCLDAF 
HMDPKSVKDLSQHTVLEIICTEEYSTGYTTNSKI IHFEIKDRFALFAGPRLRNMASLYGQLD 
TTKKLRDFFTVTDLRIRLLRPAVGEI FVDELHLARYFYAI S D I KVRGRC KCNLHATVCVYDN 
SKLTCECEHNTTGPDCGKCKKNYQGRPWSPGSYLPIPKGTANTCIPSISSIGTNVCDNELLH 
CQNGGTCHNNVRCLCPAAYTGILCEKLRCEEAGSCGSDSGQGAPPHGTPALLLLTTLLGTAS 
PLVF 
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FIGURE 75 

CCCACGCGTCCGGGTGACCTGGGCCGAGCCCTCCCGGTCGGCTAAGATTGCTGAGGAGGCGG 
CGGGTAGCTGGCAGGCGCCGACTTCCGAAGGCCGCCGTCCGGGCGAGGTGTCCTCATGACTT 
CTCTTGTGGACCATGTCCGTGATCTTTTTTGCCTGCGTGGTACGGGTAAGGGATGGACTGCC 
CCTCTCAGCCTCTACTGATTTTTACCACACCCAAGATTTTTTGGAATGGAGGAGACGGCTCA 
AGAGTTTAGCCTTGCGACTGGCCCAGTATCCAGGTCGAGGTTCTGCAGAAGGTTGTGACTTT 
AGTATACATTTTTCTTCTTTCGGGGACGTGGCCTGCATGGCTATCTGCTCCTGCCAGTGTCC 
AGCAGCCATGGCCTTCTGCTTCCTGGAGACCCTGTGGTGGGAATTCACAGCTTCCTATGACA 
CTACCTGCATTGGCCTAGCCTCCAGGCCATACGCTTTTCTTGAGTTTGACAGCATCATTCAG 
AAAGTGAAGTGGCATTTTAACTATGTAAGTTCCTCTCAGATGGAGTGCAGCTTGGAAAAAAT 
TCAGGAGGAGCTCAAGTTGCAGCCTCCAGCGGTTCTCACTCTGGAGGACACAGATGTGGCAA 
ATGGGGTGATGAATGGTCACACACCGATGCACTTGGAGCCTGCTCCTAATTTCCGAATGGAA 
CCAGTGACAGCCCTGGGTATCCTCTCCCTCATTCTCAACATCATGTGTGCTGCCCTGAATCT 
CATTCGAGGAGTTCACCTTGCAGAACATTCTTTACAGGATCCAAGGAGCTGGTTCTGCTGGT 
TGGACCAAACCTCGTGAGCCAGCCACCCCTGACCCAAATGAGGAGAGCTCTGATTCTCCCAT 
CCGGGAGCAGTGATGTCAAACTTCTGCTGCTGGGGAAATCTCATCAGCAGGGAGCCTGTGGA 
AAAGGGCATGTCAGTGAAATCTGGGAATGGCTGGATTCGGAAACATCTGCCCATGTGTATTG 
ATGGCAGAGCTGTTGCCCACAAGCGCCTTTTATTTAGGGTAAAATTAACAAATCCATTCTAT 
TCCTCTGACCCATGCTTAGTACATATGACCTTTAACCCTTACATTTATATGATTCTGGGGTT 
GCTTCAGAAGTGTTATTTCATGAATCATTCATATGATTTGATCCCCCAGGATTCTATTTTGT 
TTAATGGGCTTTTCTACTAAAAGCATAAAATACTGAGGCTGATTTAGTCAGGGCAAAACCAT 
TTACTTTACATATTCGTTTTCAATACTTGCTGTTCATGTTACACAAGCTTCTTACGGTTTTC 
TTGTAACAATAAATATTTTGAGTAAATAATGGGTACATTTTAACAAACTCAGTAGTACAACC 
TAAACTTGTATAAAAGTGTGTAAAAATGTATAGCCATTTATATCCTATGTATAAATTAAATG 
AGGTGGCTTCAGAAATGGCAGAATAAATCTAAAGTGTTTATTAAAAAAAAAAAAAAAAAAAA 
AAAAG 
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MSVIFFACWRVRDGLPLSASTDFYHTQDFLEWRRRLKSLALRLAQYPGRGSAEGCDFSIHF 
SSFGDVACMAICSCQCPAAMAFCFLETLWWEFTASYDTTCIGIiASRPYAFLEFDSIIQKVKW 
HFNYVSSSQMECSLEKIQEELKLQPPAVLTLEDTDVANGVl^GHTPMHLEPAPNFRMEPVTA 
LGILSLILNIMCAALNLIRGVHLAEHSLQDPRSWFCWLDQTS 
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FIGURE 77 

TGCTTCCTGGAGACCCTGTGGTGGGAATTCACAGCTTCNTATGACACTACCTGCATTGGCNT 
AGCCTCCAGGCCATACGCTTTTCTTGAGTTTGACAGCATCATTCAGAAAGTGAAGTGGCATT 
TTAACTATGTAAGTTCCTNTCAGATGGAGTGCAGCTTGGAAAAAATTCAGGAGGAGCTCAAG 
TTGCAGCCTCCAGCGGTTCTCANTATGGAGGACACAGATGTGGCAAATGGGGT 
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CTCAGCGGCGCTTCCTCGTAGCGAGCCTAGTGGCGGGTGTTTGCATTGAAACGTGAGCGCGA 
CCCGACCTTAAAGAGTGGGGAGCAAAGGGAGGACAGAGCCCTTTAAAACGAGGCGGGTGGTG 
CCTGCCCCTTTAAGGGCGGGGCGTCCGGACGACTGTATCTGAGCCCCAGACTGCCCCGAGTT 
TCTGTCGCAGGCTGCGAGGAAAGGCCCCTAGGCTGGGTCTGGGTGCTTGGCGGCGGCGGCTT 
CCTCCCCGCTCGTCCTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTA 
TGGAAGCACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGC 
GAGTGTATTATATCAACACTTCTGTTTGCAACACTGTACATCCTCTGCCACATCTTCCTGAC 
CCGCTTCAAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCGTCAACAAGA 
TTGCGCTCGAGCTGTGCACCTTTACCCTGGCAATTGCCCTGGGTGCTGTCCTGCTCCTGCCC 
TTCTCCATCATCAGCAATGAGGTGCTGCTCTCCCTGCCTCGGAACTACTACATCCAGTGGCT 
CAACGGCTCCCTCATCCATGGCCTCTGGAACCTTGTTTTTCTCTTCCCCAACCTGTCCCTCA 
TCTTCCTCATGCCCTTTGCATATTTCTTCACTGAGTCTGAGGGCTTTGCTGGCTCCAGAAAG 
GGTGTCCTGGGCCGGGTCTATGAGACAGTGGTGATGTTGATGCTCCTCACTCTGCTGGTGCT 
AGGTATGGTGTGGGTGGCATCAGCCATTGTGGACAAGAACAAGGCCAACAGAGAGTCACTCT 
ATGACTTTTGGGAGTACTATCTCCCCTACCTCTACTCATGCATCTCCTTCCTTGGGGTTCTG 
CTGCTCCTGGTGTGTACTCCACTGGGTCTCGCCCGCATGTTCTCCGTCACTGGGAAGCTGCT 
AGTCAAGCCCCGGCTGCTGGAAGACCTGGAGGAGCAGCTGTACTGCTCAGCCTTTGAGGAGG 
CAGCCCTGACCCGCAGGATCTGTAATCCTACTTCCTGCTGGCTGCCTTTAGACATGGAGCTG 
CTACACAGACAGGTCCTGGCTCTGCAGACACAGAGGGTCCTGCTGGAGAAGAGGCGGAAGGC 
TTCAGCCTGGCAACGGAACCTGGGCTACCCCCTGGCTATGCTGTGCTTGCTGGTGCTGACGG 
GCCTGTCTGTGCTCATTGTGGCCATCCACATCCTGGAGCTGCTCATCGATGAGGCTGCCATG 
CCCCGAGGCATGCAGGGTACCTCCTTAGGCCAGGTCTCCTTCTCCAAGCTGGGCTCCTTTGG 
TGCCGTCATTCAGGTTGTACTCATCTTTTACCTAATGGTGTCCTCAGTTGTGGGCTTCTATA 
GCTCTCCACTCTTCCGGAGCCTGCGGCCCAGATGGCACGACACTGCCATGACGCAGATAATT 
GGGAACTGTGTCTGTCTCCTGGTCCTAAGCTCAGCACTTCCTGTCTTCTCTCGAACCCTGGG 
GCTCACTCGCTTTGACCTGCTGGGTGACTTTGGACGCTTCAACTGGCTGGGCAATTTCTACA 
TTGTGTTCCTCTACAACGCAGCCTTTGCAGGCCTCACCACACTCTGTCTGGTGAAGACCTTC 
ACTGCAGCTGTGCGGGCAGAGCTGATCCGGGCCTTTGGGCTGGACAGACTGCCGCTGCCCGT 
CTCCGGTTTCCCCCAGGCATCTAGGAAGACCCAGCACCAGTGACCTCGAGCTGGGGGTGGGA 
AGGAAAAAACTGG ACACTG C CATCTG CTGC CT AGG C CTGGAGGGAAGC CCAAGG CT ACTTGG 
ACCTCAGGACCTGGAATCTGAGAGGGTGGGTGGCAGAGGGGAGCAGAGCCATCTGCACTATT 
GCATAATCTGAGCCAGAGTTTGGGACCAGGACCTCCTGCTTTTCCATACTTAACTGTGGCCT 
CAGCATGGGGTAGGGCTGGGTGACTGGGTCTAGCCCCTGATCCCAAATCTGTTTACACATCA 
ATCTGCCTCACTGCTGTTCTGGGCCATCCCCATAGCCATGTTTACATGATTTGATGTGCAAT 
AGGGTGGGGTAGGGGCAGGGAAAGGACTGGGCCAGGGCAGGCTCGGGAGATAGATTGTCTCC 
CTTGCCTCTGGCCCAGCAGAGCCTAAGCACTGTGCTATCCTGGAGGGGCTTTGGACCACCTG 
AAAGACCAAGGGGATAGGGAGGAGGAGGCTTC AG CCAT C AGC AATAAAGT TGAT CC CAGGG A 
AAAAAA 
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FIGURE 79 

MEAPDYEVLSVREQLFHERIRECIISTLLFATLYILCHIFLTRFKKPAEFTTVDDEDATVNK 
IALELCTFTLAIALGAVLLLPFSIISNEVLLSLPRNYYIQWLNGSLIHGLWNLVFLFPNLSL 
IFLMPFAYFFTESEGFAGSRKGVLGRWETVVMLMLLTLLVLGMVWVASAIVDKNKANRESL 
YDFWEYYLPYLYSCISFLGVLLLLVCTPLGLARMFSVTGKLLVKPRLLEDLEEQLYCSAFEE 
AALTRRICNPTSCWLPLDMELLHRQVLALQTQRVLLEKRRKASAWQRNLGYPLAMLCLLVLT 
GLSVLIVAIHILELLIDEAAMPRGMQGTSLGQVSFSKLGSFGAVIQVVLIFYLMVSSWGFY 
SSPLFRSLRPRWHDTAMTQIIGNCVCLLVLSSALPVFSRTLGLTRFDLLGDFGRFNWLGNFY 
IVFLYNAAFAGLTTLCLVKTFTAAVRAELIRAFGLDRLPLPVSGFPQASRKTQHQ 
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FIGURE 80 

GGCTGCCGAGGGAAGGCCCCTTGGGTTGGTCTTGGTTGCTTGGCGGCGGCGGNTTCNTCCCC 
GCTCGTCCTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTATGGAAGC 
ACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGCGAGTGTA 
TTATATCAACACTTCTGTTTGCAACACTGTACATCCTCTGCCACATCTTCCTGACCCGCTTC 
AAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCG 
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FIGURE 81 

GACCGACCTTAAAGAGTGGGAGCAAAGGGAGGACAGAGCCTTTTAAAACGAGGCGGTGGTGC 
CTGCCCTTTAAGGGCGGGGCGTCCGGACGACTGTATCTGAGCCCCAGACTGCCCCGAGTTTC 
TGTCGCAGGCTGCGAGGAAAGGCCCCTAGGCTGGGTCTGGTGCTTGGCGGCGGCGGCTTCCT 
CCCCGTTGTCNTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTATGGA 
AGCACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGCGAGT 
GTATTATATCAACACTTCTGTTTGCAACACTGTACATCNTCTGCCACATCTTCCTGACCCGC 
TTCAAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCGTCAACAAGATTGC 
GCTCGAGCTGTGCACCTTTACCCTGGCAATTGCCCTGGGTGCTGTCCTGCTCCTGCCCTTCT 
CCATCATCAGCAATGAGGTGCTGCACTCCC 
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FIGURE 82 

GATGTGCTCCTTGGAGCTGGTGTGCAGTGTCCTGACTGTAAGATCAAGTCCAAACCTGTTTT 
GGAATTGAGGAAACTTCTCTTTTGATCTCAGCCCTTGGTGGTCCAGGTCTTCATGCTGCTGT 
GGGTGATATTACTGGTCCTGGCTCCTGTCAGTGGACAGTTTGCAAGGACACCCAGGCCCATT 
ATTTTCCTCCAGCCTCCATGGACCACAGTCTTCCAAGGAGAGAGAGTGACCCTCACTTGCAA 
GGGATTTCGCTTCTACTCACCACAGAAAACAAAATGGTACCATCGGTACCTTGGGAAAGAAA 
TACTAAGAGAAACCCCAGACAATATCCTTGAGGTTCAGGAATCTGGAGAGTACAGATGCCAG 
GCCCAGGGCTCCCCTCTCAGTAGCCCTGTGCACTTGGATTTTTCTTCAGAGATGGGATTTCC 
TCATGCTGCCCAGGCTAATGTTGAACTCCTGGGCTCAAGTGATCTGCTCACCTAGGCCTCTC 
AAAGCGCTGGGATTACAGCTTCGCTGATCCTGCAAGCTCCACTTTCTGTGTTTGAAGGAGAC 
TCTGTGGTTCTGAGGTGCCGGGCAAAGGCGGAAGTAACACTGAATAATACTATTTACAAGAA 
TGATAATGTCCTGGCATTCCTTAATAAAAGAACTGACTTCCAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 83 

MLLWVILLVLAPVSGQFARTPRPIIFLQPPWTTVFQGERVTLTCKGFRFYSPQKTKWYHRYL 
GKEILRETPDNILEVQESGEYRCQAQGSPLSSPVHLDFSSEMGFPHAAQANVELLGSSDLLT 
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FIGURE 84 

CAGAAGAGGGGGCTAGCTAGCTGTCTCTGCGGACCAGGGAGACCCCCGCGCCCCCCCGGTGT 
GAGGCGGCCTCACAGGGCCGGGTGGGCTGGCGAGCCGACGCGGCGGCGGAGGAGGCTGTGAG 
GAGTGTGTGGAACAGGACCCGGGACAGAGGAACCATGGCTCCGCAGAACCTGAGCACCTTTT 
GCCTGTTGCTGCTATACCTCATCGGGGCGGTGATTGCCGGACGAGATTTCTATAAGATCTTG 
GGGGTGCCTCGAAGTGCCTCTATAAAGGATATTAAAAAGGCCTATAGGAAACTAGCCCTGCA 
GCTTCATCCCGACCGGAACCCTGATGATCCACAAGCCCAGGAGAAATTCCAGGATCTGGGTG 
CTGCTTATGAGGTTCTGTCAGATAGTGAGAAACGGAAACAGTACGATACTTATGGTGAAGAA 
GGATTAAAAGATGGTCATCAGAGCTCCCATGGAGACATTTTTTCACACTTCTTTGGGGATTT 
TGGTTTCATGTTTGGAGGAACCCCTCGTCAGCAAGACAGAAATATTCCAAGAGGAAGTGATA 
TTATTGTAGATCTAGAAGTCACTTTGGAAGAAGTATATGCAGGAAATTTTGTGGAAGTAGTT 
AGAAACAAACCTGTGGCAAGGCAGGCTCCTGGCAAACGGAAGTGCAATTGTCGGCAAGAGAT 
GCGGACCACCCAGCTGGGCCCTGGGCGCTTCCAAATGACCCAGGAGGTGGTCTGCGACGAAT 
GCCCTAATGTCAAACTAGTGAATGAAGAACGAACGCTGGAAGTAGAAATAGAGCCTGGGGTG 
AGAGACGGCATGGAGTACCCCTTTATTGGAGAAGGTGAGCCTCACGTGGATGGGGAGCCTGG 
AGATTTACGGTTCCGAATCAAAGTTGTCAAGCACCCAATATTTGAAAGGAGAGGAGATGATT 
TGTACACAAATGTGACAATCTCATTAGTTGAGTCACTGGTTGGCTTTGAGATGGATATTACT 
CACTTGGATGGTCACAAGGTACATATTTCCCGGGATAAGATCACCAGGCCAGGAGCGAAGCT 
ATGGAAGAAAGGGGAAGGGCTCCCCAACTTTGACAACAACAATATCAAGGGCTCTTTGATAA 
TCACTTTTGATGTGGATTTTCCAAAAGAACAGTTAACAGAGGAAGCGAGAGAAGGTATCAAA 
CAGCTACTGAAACAAGGGTCAGTGCAGAAGGTATACAATGGACTGCAAGGATAT TGA GAGTG 
AATAAAATTGGACTTTGTTTAAAATAAGTGAATAAGCGATATTTATTATCTGCAAGGTTTTT 
TTGTGTGTGTTTTTGTTTTTATTTTCAATATGCAAGTTAGGCTTAATTTTTTTATCTAATGA 
TCATCATGAAATGAATAAGAGGGCTTAAGAATTTGTCCATTTGCATTCGGAAAAGAATGACC 
AGCAAAAGGTTTACTAATACCTCTCCCTTTGGGGATTTAATGTCTGGTGCTGCCGCCTGAGT 
TTCAAGAATTAAAGCTGCAAGAGGACTCCAGGAGCAAAAGAAACACAATATAGAGGGTTGGA 
GTTGTTAGCAATTTCATTCAAAATGCCAACTGGAGAAGTCTGTTTTTAAATACATTTTGTTG 
TTATTTTTA 
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FIGURE 85 

MAPQNLSTFCLLLLYLIGAVIAGRDFYKILGVPRSASIKDIKKAYRKLALQLHPDRNPDDPQ 
AQEKFQDLGAAYEVLSDSEKRKQYDTYGEEGLKDGHQSSHGDIFSHFFGDFGFMFGGTPRQQ 
DRNI PRGSDI I VDLEVTLEEVYAGNFVEWRNKPVARQAPGKRKCNCRQEMRTTQLGPGRFQ 
MTQEWCDECPNVKLVNEERTLEVEIEPGVRDGMEYPFIGEGEPHVDGEPGDLRFRIKWKH 
P I FE RRGDDLYTNVT I SLVE SLVGFEMD I THLDGHKVH I S RDKI TRPGAKLWKKGEGLPNFD 
NNNI KGSL 1 1 TFDVDF PKEQLTEEAREG I KQLLKQGS VQKVYNGLQGY 
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FIGURE 86 

TGGGACCAGGGAACCCCGGGCCCCCCGGTGGAGNGCCTAACAGGCCGGTGGNTGCGACCGAA 
GCGGCGGGCGGAGGAGGTTTTGAGGATTTTTGGAACAGGACCCGGACAGAGGAACCATGGTT 
CCGCAGAACNTGAGCACNTTTTGCCTGTTGNTGNTATACTTCATCGGGGCGGTGATTGCCGG 
ACGAGATTTNTATAAGATTTTGGGGTGCCTNGAAGTGCCTTNTATAAAGGATATTAAAAAGG 
CCTATAGGAAACTAGCCCTGCAGNTTTATCCCGACCGGAACCCTGATGATCCACAAGCCCAG 
GAGAAATTCCAGGATTTGGGTGCTGCTTATGAGGTTNTGTCAGATAGTGAGAAACGGAAACA 
GTACGATAATTATGGTGAAGAAGGATTAAAAGATGGTNATCAGAGCTCCCATGGAGACATTT 
TTTCACACTTNTTTGGGGATTTTGGTTTCATGTTTGGAGGAACCCCTNGTCAGCAAGACAGA 
AATATTCCAAGAG 
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FIGURE 87 

GGCACGAGGCGGCGGGGCAGTCGCGGGATGCGCCCGGGAGCCACAGCCTGAGGCCCTCAGGT 
CTCTGCAGGTGTCGTGGAGGAACCTAGCACCTGCCATCCTCTTCCCCAATTTGCCACTTCCA 
GCAGCTTTAGCCCATGAGGAGGATGTGACCGGGACTGAGTCAGGAGCCCTCTGGAAGCATGG 
AGACTGTGGTGATTGTTGCCATAGGTGTGCTGGCCACCATCTTTCTGGCTTCGTTTGCAGCC 
TTGGTGCTGGTTTGCAGGCAGCGCTACTGCCGGCCGCGAGACCTGCTGCAGCGCTATGATTC 
TAAGCCCATTGTGGACCTCATTGGTGCCATGGAGACCCAGTCTGAGCCCTCTGAGTTAGAAC 
TGGACGATGTCGTTATCACCAACCCCCACATTGAGGCCATTCTGGAGAATGAAGACTGGATC 
GAAGATGCCTCGGGTCTCATGTCCCACTGCATTGCCATCTTGAAGATTTGTCACACTCTGAC 
AGAGAAGCTTGTTGCCATGACAATGGGCTCTGGGGCCAAGATGAAGACTTCAGCCAGTGTCA 
GCGACATCATTGTGGTGGCCAAGCGGATCAGCCCCAGGGTGGATGATGTTGTGAAGTCGATG 
TACCCTCCGTTGGACCCCAAACTCCTGGACGCACGGACGACTGCCCTGCTCCTGTCTGTCAG 
TCACCTGGTGCTGGTGACAAGGAATGCCTGCCATCTGACGGGAGGCCTGGACTGGATTGACC 
AGTCTCTGTCGGCTGCTGAGGAGCATTTGGAAGTCCTTCGAGAAGCAGCCCTAGCTTCTGAG 
CCAGATAAAGGCCTCCCAGGCCCTGAAGGCTTCCTGCAGGAGCAGTCTGCAATTTAGTGCCT 
ACAGGCCAGCAGCTAGCCATGAAGGCCCCTGCCGCCATCCCTGGATGGCTCAGCTTAGCCTT 
CTACTTTTTCCTATAGAGTTAGTTGTTCTCCACGGCTGGAGAGTTCAGCTGTGTGTGCATAG 
TAAAGCAGGAGATCCCCGTCAGTTTATGCCTCTTTTGCAGTTGCAAACTGTGGCTGGTGAGT 
GGCAGTCTAATACTACAGTTAGGGGAGATGCCATTCACTCTCTGCAAGAGGAGTATTGAAAA 
CTGGTGGACTGTCAGCTTTATTTAGCTCACCTAGTGTTTTCAAGAAAATTGAGCCACCGTCT 
AAGAAATCAAGAGGTTTCACATTAAAATTAGAATTTCTGGCCTCTCTCGATCGGTCAGAATG 
TGTGGCAATTCTGATCTGCATTTTCAGAAGAGGACAATCAATTGAAACTAAGTAGGGGTTTC 
TTCTTTTGGCAAGACTTGTACTCTCTCACCTGGCCTGTTTCATTTATTTGTATTATCTGCCT 
GGTCCCTGAGGCGTCTGGGTCTCTCCTCTCCCTTGCAGGTTTGGGTTTGAAGCTGAGGAACT 
ACAAAGTTGATGATTTCTTTTTTATCTTTATGCCTGCAATTTTACCTAGCTACCACTAGGTG 
GATAGTAAATTTATACTTATGTTTCCCTCAAAAAAAAAAAAAAA 
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FIGURE 88 

METWIVAIGVLATIFLASFAALVLVCRQRYCRPRDLLQRYDSKPIVDLIGAMETQSEPSEL 
ELDDWITNPHIEAILENEDWIEDASGLMSHCIAILKICHTLTEKLVAMTMGSGAKMKTSAS 
VSDIIWAKRISPRVDDWKSMYPPLDPKLLDARTTALLLSVSHLVLVTRNACHLTGGLDWI 
DQSLSAAEEHLEVLREAALASEPDKGLPGPEGFLQEQSAI 
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GCTTCATTTCTCCCGACTCAGCTTCCCACCCTGGGCTTTCCGAGGTGCTTTCGCCGCTGTCC 
CCACCACTGCAGCCATGATCTCCTTAACGGACACGCAGAAAATTGGAATGGGATTAACAGGA 
TTTGGAGTGTTTTTCCTGTTCTTTGGAATGATTCTCTTTTTTGACAAAGCACTACTGGCTAT 
TGGAAATGTTTTATTTGTAGCCGGCTTGGCTTTTGTAATTGGTTTAGAAAGAACATTCAGAT 
TCTTCTTCCAAAAACATAAAATGAAAGCTACAGGTTTTTTTCTGGGTGGTGTATTTGTAGTC 
CTTATTGGTTGGCCTTTGATAGGCATGATCTTCGAAATTTATGGATTTTTTCTCTTGTTCAG 
GGGCTTCTTTCCTGTCGTTGTTGGCTTTATTAGAAGAGTGCCAGTCCTTGGATCCCTCCTAAAT 
TTACCTGGAATTAGATCATTTGTAGATAAAGTTGGAGAAAGCAAGAATATGGT ATAA CAACA 
AGTGAATTTGAAGACTCATTTAAAATATTGTGTTATTTATAAAGTCATTTGAAGAATATTCA 
GCACAAAATTAAATTACATGAAATAGCTTGTAATGTTCTTTACAGGAGTTTAAAACGTATAG 
CCTACAAAGTACCAGCAGCAAATTAGCAAAGAAGCAGTGAAAACAGGCTTCTACTCAAGTGA 
ACTAAGAAGAAGTCAGCAAGCAAACTGAGAGAGGTGAAATCCATGTTAATGATGCTTAAGAA 
ACTCTTGAAGGCTATTTGTGTTGTTTTTCCACAATGTGCGAAACTCAGCCATCCTTAGAGAA 
CTGTGGTGCCTGTTTCTTTTCTTTTTATTTTGAAGGCTCAGGAGCATCCATAGGCATTTGCT 
TTTTAGAAGTGTCCACTGCAATGGCAAAAATATTTCCAGTTGCACTGTATCTCTGGAAGTGA 
TGCATGAATTCGATTGGATTGTGTCATTTTAAAGTATTAAAACCAAGGAAACCCCAATTTTG 
ATGTATGGATTACTTTTTTTTGNGCNCAGGGCC 
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FIGURE 90 

MISLTDTQKIGMGLTGFGVFFLFFGMILFFDKALLAIGNVLFVAGLAFVIGLERTFRFFFQK 
HKMKATGFFLGGVFWLIGWPLIGMIFEIYGFFLLFRGFFPWVGFIRRVPVLGSLLNLPGI 
RSFVDKVGESNNMV 

Important features: 
Transmembrane domains: 

amino acids 12-30 (typell) , 33-52, 69-89 and 93-109 

N-myristoylation sites. 

amino acids 11-16, 51-56 and 116-121 

Aminoacyl- transfer RNA synthetases class-II protein. 

amino acids 49-59 



WO 99/63088 ^ ^ v ^ PCT/US99/12252 

FIGURE 91 

GAAGACGTGGCGGCTCTCGCCTGGGCTGTTTCCCGGCTTCATTTCTCCCGACTCAGCTTCCC 
ACCNTGGGCTTTCCGAGGTGCTTTCGCCGCTGTCCCCACCACTGCAGCCATGATCTCCTTAA 
CGGACACGCAGAAAATTGGAATGGGATTAACCGGATTTGGAGTGTTTTTCCTGTTCTTTGGA 
ATGATTCTCTTTTTTGACAAAGCACTACTGGCTATTGGAAATGTTTTATTTGTAGCCGGCTT 
GGCTTTTGTAATTGGTTTAGAAAGAACATTCAGATTCTTCTTCCAAAAACATAAAATGAAAG 
CTACAGGTTTTTTTCTGGGTGGTGTATTTGTAGTCCTTATTGGTTGGCCTTTGATAGGCATG 
ATCTTCGAAATTTATGGATTTTTTCTCTTGTTC 
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FIGURE 92 

GGCACGAGGCTGAACCCAGCCGGCTCCATCTCAGCTTCTGGTTTCTAAGTCCATGTGCCAAA 
GGCTGCCAGGAAGGAGACGCCTTCCTGAGTCCTGGATCTTTCTTCCTTCTGGAAATCTTTGA 
CTGTGGGTAGTTATTTATTTCTGAATAAGAGCGTCCACGCATCATGGACCTCGCGGGACTGC 
TGAAGTCTCAGTTCCTGTGCCACCTGGTCTTCTGCTACGTCTTTATTGCCTCAGGGCTAATC 
ATCAACACCATTCAGCTCTTCACTCTCCTCCTCTGGCCCATTAACAAGCAGCTCTTCCGGAA 
GATCAACTGCAGACTGTCCTATTGCATCTCAAGCCAGCTGGTGATGCTGCTGGAGTGGTGGT 
CGGGCACGGAATGCACCATCTTCACGGACCCGCGCGCCTACCTCAAGTATGGGAAGGAAAAT 
GCCATCGTGGTTCTCAACCACAAGTTTGAAATTGACTTTCTGTGTGGCTGGAGCCTGTCCGA 
ACGCTTTGGGCTGTTAGGGGGCTCCAAGGTCCTGGCCAAGAAAGAGCTGGCCTATGTCCCAA 
TTATCGGCTGGATGTGGTACTTCACCGAGATGGTCTTCTGTTCGCGCAAGTGGGAGCAGGAT 
CGCAAGACGGTTGCCACCAGTTTGCAGCACCTCCGGGACTACCCCGAGAAGTATTTTTTCCT 
GATTCACTGTGAGGGCACACGGTTCACGGAGAAGAAGCATGAGATCAGCATGCAGGTGGCCC 
GGGCCAAGGGGCTGCCTCGCCTCAAGCATCACCTGTTGCCACGAACCAAGGGCTTCGCCATC 
ACCGTGAGGAGCTTGAGAAATGTAGTTTCAGCTGTATATGACTGTACACTCAATTTCAGAAA 
TAATGAAAATCCAACACTGCTGGGAGTCCTAAACGGAAAGAAATACCATGCAGATTTGTATG 
TTAGGAGGATCCCACTGGAAGACATCCCTGAAGACGATGACGAGTGCTCGGCCTGGCTGCAC 
AAGCTCTACCAGGAGAAGGATGCCTTTCAGGAGGAGTACTACAGGACGGGCACCTTCCCAGA 
GACGCCCATGGTGCCCCCCCGGCGGCCCTGGACCCTCGTGAACTGGGTGTTTTGGGCCTCGC 
TGGTGCTCTACCCTTTCTTCCAGTTCCTGGTCAGCATGATCAGGAGCGGGTCTTCCCTGACG 
CTGGCCAGCTTCATCCTCGTCTTCTTTGTGGCCTCCGTGGGAGTTCGATGGATGATTGGTGT 
GACGGAAATTGACAAGGGCTCTGCCTACGGCAACTCTGACAGCAAGCAGAAACTGAATGACT 
GACTCAGGGAGGTGTCACCATCCGAAGGGAACCTTGGGGAACTGGTGGCCTCTGCATATCCT 
CCTTAGTGGGACACGGTGACAAAGGCTGGGTGAGCCCCTGCTGGGCACGGCGGAAGTCACGA 
CCTCTCCAGCCAGGGAGTCTGGTCTCAAGGCCGGATGGGGAGGAAGATGTTTTGTAATCTTT 
TTTTCCCCATGTGCTTTAGTGGGCTTTGGTTTTCTTTTTGTGCGAGTGTGTGTGAGAATGGC 
TGTGTGGTGAGTGTGAACTTTGTTCTGTGATCATAGAAAGGGTATTTTAGGCTGCAGGGGAG 
GGCAGGGCTGGGGACCGAAGGGGACAAGTTCCCCTTTCATCCTTTGGTGCTGAGTTTTCTGT 
AACCCTTGGTTGCCAGAGATAAAGTGAAAAGTGCTTTAGGTGAGATGACTAAATTATGCCTC 
CAAGAAAAAAAAATTAAAGTGCTTTTCTGGGTCAAAAAAAAAAAA 
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FIGURE 93 

MDLAGLLKSQFLCHLVFCYVF I ASGL I INT I QLFTLLLWP INKQLFRKINCRLS YC I SSQLV 
MLLEWWSGTECTIFTDPRAYLKYGKENAIVVLNHKFEIDFLCGWSLSERFGLLGGSKVLAKK 
ELAYVPIIGWMWYFTEMVFCSRKWEQDRKTVATSLQHLRDYPEKYFFLIHCEGTRFTEKKHE 
ISMQVARAKGLPRLKHHLLPRTKGFAITVRSLRNWSAVYDCTLNFRNNENPTLLGVLNGKK 
YHADLYVRRI PLED I PEDDDECSAWLHKLYQEKDAFQEE YYRTGTFPETPMVPPRRPWTLVN 
WLFWASLVLYPFFQFLVSMIRSGSSLTLASFILVFFVASVGVRWMIGVTEIDKGSAYGNSDS 
KQKLND 
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FIGURE 94 

CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCG 
GCGCACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAA 
GTAAAAGGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGATGATGTTGAAGTTGTTTA 
TACAATTGACATTCAGAAATATATTCCATGCTATCAGCTTTTTAGCTTTTATAATTCTTCAG 
GCGAAGTAAATGAGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGT 
TGGTACAAATTCCGTCGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCTTCACAA 
AAACTTGCAGGAGCATTTTTCAAACCAAGACCTTGTTTTTCTGCTATTAACACCAAGTATAA 
TAACAGAAAGCTGCTCTACTCATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTT 
TTTCACAGGGTACCTTTAGTGGTTGCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAAC 
TGTATCAGGTTCCTGTATGTCCACTGGTTTTAGCCGAGCAGTACAAACACACAGCTCTAAAT 
TTTTTGAAGAAGATGGATCCTTAAAGGAGGTACATAAGATAAATGAAATGTATGCTTCATTA 
CAAGAGGAATTAAAGAGTATATGCAAAAAAGTGGAAGACAGTGAACAAGCAGTAGATAAACT 
AGTAAAGGATGTAAACAGATTAAAACGAGAAATTGAGAAAAGGAGAGGAGCACAGATTCAGG 
CAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGAACATTTTTCTTTGTCAGGCATTA 
CGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTTATGTCTTTAAAAAATAGACA 
TGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGTAGACAATCTGACCTTAA 
TGGTAGAACACACTGACATTCCTGAAGCTAGTCCAGCTAGTACACCACAAATCATTAAGCAT 
AAAGCCTTAGACTTAGATGACAGATGGCAATTCAAGAGATCTCGGTTGTTAGATACACAAGA 
CAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAATGAGCAGCC 
CAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTTTTGGTGAATATTCACGGTCTCCTACA 
TTTTGATCCTTTTAACCTTACAAGGAGATTTTTTTATTTGGCTGATGGGTAAAGCCAAACAT 
TTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCAC 
CTGTTTGCAGTAATACACAGATAACTCTTAGTGCATTTACTTCACAAAGTACTTTTTCAAAC 
ATCAGATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCT 
TACACAGACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAA 
TCCCAGCACTTAGGGAAGACAAGTCAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCC 
TGGGCAACGTATTGAGACCATGTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCCTTAT 
TTTCAAAATATGGAAAGAAATTTATATGAAAATTTATCTGAGTCATTAAAATTCTCCTTAAG 
TGATACTTTTTTAGAAGTACATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCA 
ATAAATTTGCAAAACATCATCTAAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 95 

MEGESTSAVLSGFVLGAIAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEVVYTIDIQ 
KYIPCYQLFSFYNSSGEWEQALKKILSWKKNWGWYKFRRHSDQIMTFRERLLHKNLQEH 
FSNQDLVFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSC 
MSTGFSRAVQTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVN 
RLKREIEKRRGAQIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSS 
CNYNHHLDWDNLTLMVEHTDI PEASPASTPQI IKHKALDLDDRWQFKRSRLLDTQDKRSKA 
NTGSSNQDKASKMSSPETDEEIEKMKGFGEYSRSPTF 
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FIGURE 96 

GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCA 
GCGCAGGGCAGCCCAAGCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCC 
CCGCGGCGCCTGCCACCCTTCCCTCCTTCCCCGCGTCCCCGCCTCGCCGGCCAGTCAGCTTG 
CCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCTGCTTCCCTGGGCCG 
CGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCT 
GACGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCA 
ACTCCAACTCCTTCTCCCTCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCC 
GCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAAGGTGGGAACGCGTCCGCCCCGGCCCGCA 
CCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTCAGCGCCGCGCTG 
CTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGG 
CTTCAACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCC 
AGGGTTCTACCTGCTGCTCTCAAGAGATGGAGGAGAAGTACAGCCTGCAAAGTAAAGATGAT 
TTCAAAAGTGTGGTCAGCGAACAGTG(^ATCATTTGCAAGCTGTCTTTGCTTCACGTTACAA 
GAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTGAATGATATGT 
TTGTGAAGACATATGGCCATTTATACATGCAAAATTCTGAGCTATTTAAAGATCTCTTCGTA 
GAGTTGAAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTG 
GGCTCGCCTCCTGGAGCGGATGTTCCGCCTGGTGAACTCCCAGTACCACTTTACAGATGAGT 
ATCTGGAATGTGTGAGCAAGTATACGGAGCAGCTGAAGCCCTTCGGAGATGTCCCTCGCAAA 
TTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGCTTAGCGGT 
TGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCC 
TGTTGAAGATGATCTACTGCTCCCACTGCCGGGGTCTCGTGACTGTGAAGCCATGTTACAAC 
TACTGCTCAAACATCATGAGAGGCTGTTTGGCCAACCAAGGGGATCTCGATTTTGAATGGAA 
CAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTAGAGGGTCCTTTCAACATTGAAT 
CGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGATAATAGT 
GTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACG 
AATTTCTCGTTCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGG 
AACGCCCAACCACAGCAGCTGGCACTAGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAA 
CTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAACGTTTGCAACGATGAGAGGAT 
GGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTACCTGT 
TTGCAGTGACAGGAAATGGATTAGCCAACCAGGGCAACAACCCAGAGGTCCAGGTTGACACC 
AGCAAACCAGACATACTGATCCTTCGTCAAATCATGGCTCTTCGAGTGATGACCAGCAAGAT 
GAAGAATGCATACAATGGGAACGACGTGGACTTCTTTGATATCAGTGATGAAAGTAGTGGAG 
AAGGAAGTGGAAGTGGCTGTGAGTATCAGCAGTGCCCTTCAGAGTTTGACTACAATGCCACT 
GACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCACA 
GGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAGATAAT 
TCTCAAACTCTGAGAAAAAGTGTTCATCAAAAAGTTAAAAGGCACCAGTTATCACTTTTCTA 
CCATCCTAGTGACTTTGCTTTTTAAATGAATGGACAACAATGTACAGTTTTTACTATGTGGC 
CACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGGAGGAAAAGGGACTGTG 
CATTGAGTTGGTTCCTGCTCCCCCAAACCATGTTAAACGTGGCTAACAGTGTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTC 
ATTTCGTTTGTGGGTTTTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAG 
GGTCCCTTCTTGGCACGTAACATGTACGTATTTCTGAAATATTAAATAGCTGTACAGAAGCA 
GGTTTTATTTATCATGTTATCTTATTAAAAGAAAAAGCCCAAAAAGC ~~ ' 
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FIGURE 97 

MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQ 
GSTCCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMF 
VKTYGHLYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEY 
LECVSKYTEQLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSWNPTAQCTHAL 
LKMIYCSHCRGLVTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMLMVAERLEGPFNIES 
VMDPIDVKISDAIMNMQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEE 
RPTTAAGTSLDRLVTDVKEKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLF 
AVTGNGLANQGNNPEVQVDTSKPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSG^ 
GSGSGCEYQQCPSEFDYNATDHAGKSANEKADSAGVRPGAQAYLLTVFCILFLVMQREWR 



WO 99/63088 A O "f /b\ O PCT/US99/12252 

FIGURE 98 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCT 
GACCTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAA 
GCAACTTAGAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGC 
TGCCACTAATGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCAC 
AGGGACCGAGGCCAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAA 
AGATTGGTTCCTGAGAGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGC 
AGTGCCCCTGTGATCATTTCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGA 
AAGCCAAACAAGCATTCCAGAGCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTT 
TGCTCTGCCTTTGTAGGAGCTCTGAGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAA 
GACAGTGAGCACACCTACCAGACACTCTTCTTCTCCCACCTCACTCTCCCACTGTACCCACC 
CCTAAATCATTCCAGTGCTCTCAAAAAGCATGTTTTTCAAGATCATTTTGTTTGTTGCTCTC 
TCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCTGTGCCCTCCCCTTACCCAGGCTTAGGCTT 
AATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCTAGCTAGTGTCATTTAACCTTAAATGC 
AATCAGGAAAGTAGCAAACAGAAGTCAATAAATATTTTTAAATGTCAAAAAAAAAAAAAAAAA^ 
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FIGURE 99 



MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRR 
KFMTVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 
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FIGURE 100 

AATGGCTGTCTTAGTACTTCGCCTGACAGTTGTCCTGGGACTGCTTGTCTTATTCCTGACCT 
GCTATGCAGACGACAAACCAGACAAGCCAGACGACAAGCCAGACGACTCGGGCAAAGACCCA 
AAGCCAGACTTCCCCAAATTCCTAAGCCTCCTGGGCACAGAGATCATTGAGAATGCAGTCGA 
GTTCATCCTCCGCTCCATGTCCAGGAGCACAGGATTTATGGAATTTGATGATAATGAAGGAA 
AACATTCATCAAAGTGACATCCTCAGGACACACCCATGTGGCTCCTGGACAATCCAAGAGCA 
GCCAAATCCTGCTTTTCCAGTTTGGCTCCACAAGTCCTCCAGGACAGAGCCCTCAAAGCAAC 
TCCCAACGAGTTCTCAGGATTCAGGCTCTGGCTTCAACCAAACAGAACTCATTTTGAACACC 
CTGACTGCATTTTTGCTTTTAGAAAGTTAGAATAAATATGGCGCTTTGGGATCACATAGTTG 
ATGGAGAGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 101 

MAVLVLRLTVVLGLLVLFLTCYADDKPDKPDDKPDDSGKDPKPDFPKFLSLLGTEIIENAVE 
FILRSMSRSTGFMEFDDNEGKHSSK 
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FIGURE 102 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCT 
CAGAGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTAC 
CCTGCCCCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCC 
CCTACCTGATGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAG 
CTCTTCAGCCAGATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGG 
CTGCGGAACCGGAGCCAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACC 
CAAATCCCCACTTTGAGAAGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATAT 
GAGCGGTTTGTGGTGGCTCCTGGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGT 
GGTGGTCTGCACTCTGGTGCTGTGCTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCC 
GGAGAGTACTGAGACCGGGAGGTGTGCTCTTTTTCTGGGAGCATGTGGCAGAACCATATGGA 
AGCTGGGCCTTCATGTGGCAGCAAGTTTTCGAGCCCACCTGGAAACACATTGGGGATGGCTG 
CTGCCTCACCAGAGAGACCTGGAAGGATCTTGAGAACGCCCAGTTCTCCGAAATCCAAATGG 
AACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTGGGCCCCACATCATGGGAAAGGCTGTC 
AAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCCTTCCCCAGCCTCCAATTAGAACA 
AGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTAGCAGAATGAGAGAAGACATT 
CATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGCAATCTCTAACTTCAATC 
CCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGGAAACACTAGGACCC 
TGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTCCCAATGTTGTC 
CCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACACCCATGCGT 
CTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGACCCTCT 
CTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGGAT 
GCCAGAGCAAGACTCAAAGAGGCAGAGGTTTTGTTCTCAAATATTTTTTAATAAATAGACGA 
AACCACG 
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FIGURE 103 

MDILVPLLQLLVLLLTLPLHLMAL^ 

KGLTGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFW 
APGEDMRQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFM 
WQQVFEPTWKHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFP 
SSKALICSFPSLQLEQATHQPIYLPLRGT 
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FIGURE 104 

GTGGGATTTATTTGAGTGCAAGATCGTTTTCTCAGTGGTGGTGGAAGTTGCCTCATCGCAGG 

CAGATGTTGGGGCTTTGTCCGAACAGCTCCCCTCTGCCAGCTTCTGTAGATAAGGGTTAAAA 

ACTAATATTTATATGACAGAAGAAAAAGATGTCATTCCGTAAAGTAAACATCATCATCTTGG 

TCCTGGCTGTTGCTCTCTTCTTACTGGTTTTGCACCATAACTTCCTCAGCTTGAGCAGTTTG 

TTAAGGAATGAGGTTACAGATTCAGGAATTGTAGGGCCTCAACCTATAGACTTTGTCCCAAA 

TGCTCTCCGACATGCAGTAGATGGGAGACAAGAGGAGATTCCTGTGGTCATCGCTGCATCTG 

AAGACAGGCTTGGGGGGGCCATTGCAGCTATAAACAGCATTCAGCACAACACTCGCTCCAAT 

GTGATTTTCTACATTGTTACTCTCAACAATACAGCAGACCATCTCCGGTCCTGGCTCAACAG 

TGATTCCCTGAAAAGCATCAGATACAAAATTGTCAATTTTGACCCTAAACTTTTGGAAGGAA 

AAGTAAAGGAGGATCCTGACCAGGGGGAATCCATGAAACCTTTAACCTTTGCAAGGTTCTAC 

TTGCCAATTCTGGTTCCCAGCGCAAAGAAGGCCATATACATGGATGATGATGTAATTGTGCA 

AGGTGATATTCTTGCCCTTTACAATACAGCACTGAAGCCAGGACATGCAGCTGCATTTTCAG 

AAGATTGTGATTCAGCCTCTACTAAAGTTGTCATCCGTGGAGCAGGAAACCAGTACAATTAC 

ATTGGCTATCTTGACTATAAAAAGGAAAGAATTCGTAAGCTTTCCATGAAAGCCAGCACTTG 

CTCATTTAATCCTGGAGTTTTTGTTGCAAACCTGACGGAATGGAAACGACAGAATATAACTA 

ACCAACTGGAAAAATGGATGAAACTCAATGTAGAAGAGGGACTGTATAGCAGAACCCTGGCT 

GGTAGCATCACAACACCTCCTCTGCTTATCGTATTTTATCAACAGCACTCTACCATCGATCC 

TATGTGGAATGTCCGCCACCTTGGTTCCAGTGCTGGAAAACGATATTCACCTCAGTTTGTAA 

AGGCTGCCAAGTTACTCCATTGGAATGGACATTTGAAGCCATGGGGAAGGACTGCTTCATAT 

ACTGATGTTTGGGAAAAATGGTATATTCCAGACCCAACAGGCAAATTCAACCTAATCCGAAG 

ATATACCGAGATCTCAAACATAAAGTGAAACAG7UVTTTGAACTGTAAGCAAGCATTTCTCAG 

GAAGTCCTGGAAGATAGCATGCATGGGAAGTAACAGTTGCTAGGCTTCAATGCCTATCGGTA 

GCAAGCCATGGAAAAAGATGTGTCAGCTAGGTAAAGATGACAAACTGCCCTGTCTGGCAGTC 

AGCTTCCCAGACAGACTATAGACTATAAATATGTCTCCATCTGCCTTACCAAGTGTTTTCTT 

ACTACAATGCTGAATGACTGGAAAGAAGAACTGATATGGCTAGTTCAGCTAGCTGGTACAGA 

TAATTCAAAACTGCTGTTGGTTTTAATTTTGTAACCTGTGGCCTGATCTGTAAATAAAACTT 

ACATTTTTC 
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FIGURE 105 



MS FRKVN 1 1 ILVLAVALFLL VLHHNFLS LS SLLRNE VTDSG I VG PQP I DFVPNALRHAVDGR 
QEE I PWIAASEDRLGGAI AAINS IQHNTRSNVI FY I VTLNNTADHLRSWLNSDSLKS IRYK 
IWFDPKLLEGKVKEDPDQGESMKPLTFARFYLPILVPSAKKAIYMDDDVIVQGDILALYNT 
ALKPGHAAAFSEDCDSASTKVVIRGAGNQYNYIGYLDYKKERIRKLSMKASTCSFNPGVFVA 
NLTEWKRQNITNQLEKWMKL3WEEGLYSRTLAGSITTPPLLIVFYQQHSTIDPMWNVRHLGS 
SAGKRYSPQFVKAAKLLHWNGHLKPWGRTASYTDVWEKWYIPDPTGKFNLIRRYTEISNIK 
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FIGURE 106 

TGGTTTTTGCCCCATAAATTCCCTCAGCTTGAGCAGTTTGTTAAGGAATGAGGTTACAGATT 
CAGGAATTNTAGGNCCTCAACCTNTAGANTTTGTCCCAAATGTTCTCCGACATGCAGTAGAT 
GGGAGACAAGAGGAGATTCCTGTGGTCATCGCTGCATNTGAAGACAGGCTTGGGGGGGCCAT 
TGCAGCTATAAACAGCATTCAGCACAACACTCGNTCCAATGTGATTTTCTACATTGTTACTC 
TCAACAATACAGCAGACCATNTCCGGTCCTGGNTCAACAGTGATTCCCTGAAAAGCATCAGA 
TACAAAATTGTGAATTTTGACCCTAAACTTTTGGAAGGAAT^AGTAAAGGAGGATCCTGACCA 
GGGGGAATCCATGAAACCTTTAACCTTTGCAAGGTTCTACTTGCCAATTCTGGTTCCCAGCG 
CAAAGAAGGCCATATACATGGATGATGATGTAATTGTGCAAGGTGATATTCTTGCCCTTTAC 
AATACAGCACTGAAGCCAGGACATGCAGCTGCATTTTCAGAAGATTGTGATTCAGCCTCTAC 
TAAAGTTGTCATCCGTGGAGCAGGAAA 
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FIGURE 107 

CGACGCTCTAGCGGTTACCGCTGCGGGCTGGCTGGGCGTAGTGGGGCTGCGCGGCTGCCACG 
GAGCTAGAGGGCAAGTGTGCTCGGCCCAGCGTGCAGGGAACGCGGGCGGCCAGACAACGGGC 
TGGGCTCCGGGGCCTGCGGCGCGGGCGCTGAGCTGGCAGGGCGGGTCGGGGCGCGGGCTGCA 
TCCGCATCTCCTCCATCGCCTGCAGTAAGGGCGGCCGCGGCGAGCCTTTGAGGGGAACGACT 
TGTCGGAGCCCTAACCAGGGGTGTCTCTGAGCCTGGTGGGATCCCCGGAGCGTCACATCACT 
TTCCGATCACTTCAAAGTGGTTAA/^AACTAATATTTATATGACAGAAGAAAAAGATGTCATT 
CCGTAAAGTAAACATCATCATCTTGGTCCTGGGCTGTTGCTCTCTTCTTACTGGTTTTGCAC 
CATAACTTCCTCAGCTTGAGGCAGTTTGTTAAGGAATGAGGTTACAGATTCAGGAATTGTAG 
GGCCTCAACCTATAGGACTTTGTCCCAAATGCTCTCCGACATGCAGTAGATGGGAGACAAGA 
GGAGATTCCTGTGGTCATCGCTGCATCTGAAGACAGGCTTGGGGGGGCCATTGCAGCTATAA 
ACAGCATTCAGCACAACACTCGCTCCAATGTGATTTTCTACATTGTTACTCTCAACAATACA 
GCAGACCATCTCCGGTCCTGGGCTCAACAGTGATTCCCTGAAAAGCATCAGATACAAAATTG 
TCAATTTTGACCCTAAACTTTTGGAAGGAAAAGTAAAGGAGGATCCTGACCAGGGGGAATCC 
ATGAAACCTTTAACCTTTGCAAGGTTCTACTTGCCAATTCTGGGTTCCCAGCGCAAAGAAGG 
CCATATACATGGATGATGATGTAATTGTGCAAGGTGATATTCTTGCCCTTTACAATACAGCA 
CTGAAGCCAGGACATGCAGCTGCATTTTCAGAAGATTGTGATTCAGCCTCTACTAAAGTTGT 
CATCCGTGGAGCAGGAAACCAGTACAATTACATTGGCTATCTTGACTATAAAAAGGAAAGAA 
TTCGTAAGCTTTCCATGAAAGCCAGCACTTGCTCATTTAATCCTGGAGTTTTTGTTGCAAAC 
CTGACGGAATGGAAACGACAGAATATAACTAACCAACTGGAAAAATGGATGAAACTCAATGT 
AGAAGAGGGACTGTATAGCAGAACCCTGGCTGGTAGCATCACAACACCTCCTCTGCTTATCG 
TATTTTATCAACAGCACTCTACCATCGATCCTATGTGGAATGTCCGCCACCTTGGTTCCAGT 
GCTGGAAAACGATATTCACCTCAGTTTGTAAAGGCTGCCAAGTTACTCCATTGGAATGGACA 
TTTGAAGCCATGGGGAAGGACTGCTTCATATACTGATGTTTGGGGAAAAATGGTATATTCCA 
GACCCAACAGGCAAATTCAACCTAATCCGAAGATATACCGAGATCTCAAACATAAAGTGAAA 
CAGAATTTGAACTGTAAGCAAGCATTTCTCAGGAAGTCCTGGAAGATAGCATGCGTGGGAAG 
TAACAGTTGCTAGGCTTCAATGCCTATCGGTAGCAAGCCATGGAAAAAGATGTGTCAGCTAG 
GTAAAGATGACAAACTGCCCTGTCTGGCAGTCAGCTTCCCAGACAGACTATAGACTATAAAT 
ATGTCTCCATCTGCCTTACCAAGTGTTTTCTTACTACAATGCTGAATGACTGGAAAGAAGAA 
CTGATATGGCTAGTTCAGCTAGCTGGTACAGATAATTCAAAACTGCTGTTGGTTTTAATTTT 
GTAACCTGTGGCCTGATCTGTAAATAAAACTTACATTTTTCAATAGGTAAAAAAAAAAAAAA 
AAAAAA 
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FTGTJRE 108 

CTGCAGGTAGACATCTCCACTGCCCAGGAATCACTGAGCGTGCAGACAGCACAGCCTCCTCT 
GAAGGCCGGCCATACCAGAGTCCTGCCTCGGCATGGGCCTCACCATTGAGGCAGCTCCACTG 
TCTGTGCTGGTCTGAGGGTGCTGCCTGTCATGGGGGCAGCCATCTCCCAGGGGGCCCTCATC 
GCCATCGTCTGCAACGGTCTCGTGGGCTTCTTGCTGCTGCTGCTCTGGGTCATCCTCTGCTG 
GGCCTGCCATTCTCGTCTGCCGACGTTGACTCTCTCTCTGAATCCAGTCCCAACTCCAGCCC 
TGGCCCCTGTCCTGAGAAGGCCCCACCACCCCAGAAGCCCAGCCATGAAGGCAGCTACCTGC 
TGCAGCCCTGAAGGCCCCTGGCCTAGCCTGGAGCCCAGGACCTAAGTCCACCTCACCTAGAG 
CCTGGAATTAGGATCCCAGAGTTCAGCCAGCCTGGGGTCCAGAACTCAAGAGTCCGCCTGCT 
TGGAGCTGGACCCAGCGGCCCAGAGTCTAGCCAGCTTGGCTCCAATAGGAGCTCAGTGGCCC 
TAAGGAGATGGGCCTGGGGTGGGGGCTTATGAGTTGGTGCTAGAGCCAGGGCCATCTGGACT 
ATGCTCCATCCCAAGGGCCAAGGGTCAGGGGCCGGGTCCACTCTTTCCCTAGGCTGAGCACC 
TCTAGGCCCTCTAGGTTGGGGAAGCAAACTGGAACCCATGGCAATAATAGGAGGGTGTCCAG 
GCTGGGCCCCTCCCCTGGTCCTCCCAGTGTTTGCTGGATAATAAATGGAACTATGGCTCTAA 
AAAAAAAAAAAAAAAAA 
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FIGURE 109 



MGAAISQGALIAIVCNGLVGFLLLLLWVILCWACHSRLPTLTLSLNPVPTPALAPVLRRPHH 
PRSPAMKAATCCSPEGPWPSLEPRT 
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FIGURE 110 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCA 

GTTCCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTA 

CTCCCTATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAA 

TCATGTCGGGAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCC 

ATGATGTTTACCTTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTAT 

TTTGGGATTGTTGTTTGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACC 

TCAGCATAGAATTGGACACAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTA 

TCCACAGGCATCACGGCAGTGCTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATT 

GACAGTTGAGCTTTTCCAAATCACAAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCC 

AGCCACTGTGGACATTTGCCATCCTCATTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTG 

AGCCTGGGAACTGCAGGAGCTGCCCAGGTTATGGAAGGCGGCCAAGTGGAATATAAGCCCCT 

TTCGGGCATTCGGTACATGTGGTCGTACCATTTAATTGGCCTCATCTGGACTAGTGAATTCA 

TCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAGTGGTTACTTGTTATTTCAACAGAAGT 

AAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTCTCCATTCTCTTCTTCTACCATCA 

AGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAGGATTCCGAGAATCATTGTCA 

TGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGTCCAGGTACCTGTTCCGA 

TGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTCAACCAGAATGCATA 

TACTACAACTGCTATTAATGGGACAGATTTCTGTACATCAGCAAAAGATGCATTCAAAATCT 

TGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAATTTTTCTA 

GGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTACAATCG 

GGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGCCC 

ATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGAT 

CTGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTT 

CGTAAAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGA 

ATGAGGAGGGAACAGAACTCCAGGCCATTGTGAGATAGATACCCATTTAGGTATCTGTACCT 

GGAAAACATTTCCTTCTAAGAGCCATTTACAGAATAGAAGATGAGACCACTAGAGAAAAGTT 

AGTGAATTTTTTTTTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 111 

MS GRDT I LGLC I LALALSLAMMFTFRF I TTLLVH I F I S LV I LGLLF VCGVLWWL YYD YTNDL 
S I ELDTERENMKCVLGFAI VSTGI TAVLLVL I FVLRKR I KLTVELFQ I TNKAI S SAPFLLFQ 
PLWTFAILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFI 
LACQQMTIAGAVVTCYFNRSKNDPPDHPILSSLSILFFYHQGTVVKGSFLISVW 
YMQNALKEQQHGALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKIL 
SKNSSHFTSINCFGDFIIFLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAH 
SFLSVFETVLDALFLCFAVDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRN 
EEGTELQA I VR 
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FIGURE 112 



PCT/US99/12252 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTT 
AGAATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTG 
TGTGGTGAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATT 
T ATGA GGACTGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGG 
TGACTGGAGTACATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTG 
CCTCAGATCAACTGCGATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATG 
TCCAGCAGGATGCCAAGACCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACT 
CCAGTGTGTGTGGCGCTGCCGTACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTT 
GTTCGGAAGGTTGCTGGACAGTCTGGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTT 
ATCCCTACCACGATGGAGAGAATCCTTTATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAA 
CCTACCCATCAGCTCTTACATACTCATCATCGAAAAGTCCAGCTGCCCAAGCAGGTGAGACC 
ACAAAAGCCTATCAGAGGCCACCTATTCCAGGGACAACTGCACAGCCGGTCACTCTGATGCA 
GCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCACCACCTTGCCAAGGCCATCCCCTTCTG 
CTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGGGCCACAGGAGCCAGGAGATGGAT 
CTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCCAGAGCTGATCCAGGTATCCA 
AAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGCGGATGTCAGCCTGGGAC 
TTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCCTGGGAGATCCAAAC 
TGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAACGGCGATTCCG 
AATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGCCGGTCCAC 
TGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACACACACG 
AATTCTCGAGATCTGAAGACAGCCATAGAGAAAATTACTCAGAGAGGAGGACTTTCTAATGT 
AGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCG 
GGGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCT 
TCAAGACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGA 
AAATGAGAAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACG 
GCTTCTACTCGCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTG 
AAGCGGGTCTGCGACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACAT 
TGGCTTCGTCATCGACGGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGT 
TTGTGACCAACCTCACCAAAGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTG 
CAGTACACCTACGAACAGCGGCTGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACAT 
CCTCAACGCCATCAAGAGGGTGGGCTACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCA 
ACTTCGCCCTGGAGCAGCTCTTCAAGAAGTCCAAGCCCAACAAGAGGAAGTTAATGATCCTC 
ATCACCGACGGGAGGTCCTACGACGACGTCCGGATCCCAGCCATGGCTGCCCATCTGAAGGG 
AGTGATCACCTATGCGATAGGCGTTGCCTGGGCTGCCCAAGAGGAGCTAGAAGTCATTGCCA 
CTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGAGTTTGACAACCTCCATCAGTATGTC 
CCCAGGATCATCCAGAACATTTGTACAGAGTTCAACTCACAGCCTCGGAACTGAATTCAGAG 
CAGGCAGAGCACCAGCAAGTGCTGCTTTACTAACTGACGTGTTGGACCACCCCACCGCTTAA 
TGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAACAAATGTCTTGTTATTA 
TTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGATGATCACAAACGT 
ATAGAATGAGCCAAAAGGCTACATCATGTTGAGGGTGCTGGAGATTTTACATTTTGACAATT 
GTTTTCAAAATAAATGTTCGGAATACAGTGCAGCCCTTACGACAGGCTTACGTAGAGCTTTT 
GTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCATTTTT 
GTCATGACAATGTAGGAATTGCTGAATTAAATGTTTAGAAGGATGAAAAATAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 113 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKC 

PAGCQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSL 

SLPRWRESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQ 

LLAVTVAVATPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQ 

RQDPSGAAFQKPVGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFR 

IQKQLLADVAQALDIGPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNV 

GRAISFVTKNFFSKANGNRSGAPNVVVVMVDGWPTDKVEEASRLARESGINIFF 

NEKQYVVEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADI 

GFVIDGSSSVGTGNFRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDI 

LNAIKRVGYWSGGTSTGAAINFALEQLFKKSKPNKRKLMILITDGRSYDDVRIPAMAAHLKG 

VITYAIGVAWAAQEELEVIATHPARDHSFFVDEFDNLHQYVPRIIQNICTEFNSQPRN 
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FIGURE 114 

CAGGATGAACTGGTTGCAGTGGCTGCTGCTGCTGCGGGGGCGCTGAGAGGACACGAGCTCTA 
TGCCTTTCCGGCTGCTCATCCCGCTCGGCCTCCTGTGCGCGCTGCTGCCTCAGCACCATGGT 
GCGCCAGGTCCCGACGGCTCCGCGCCAGATCCCGCCCACTACAGTTTTTCTCTGACTCTAAT 
TGATGCACTGGACACCTTGCTGATTTTGGGGAATGTCTCAGAATTCCAAAGAGTGGTTGAAG 
TGCTCCAGGACAGCGTGGACTTTGATATTGATGTGAACGCCTCTGTGTTTGAAACAAACATT 
CGAGTGGTAGGAGGACTCCTGTCTGCTCATCTGCTCTCCAAGAAGGCTGGGGTGGAAGTAGA 
GGCTGGATGGCCCTGTTCCGGGCCTCTCCTGAGAATGGCTGAGGAGGCGGCCCGAAAACTCC 
TCCCAGCCTTTCAGACCCCCACTGGCATGCCATATGGAACAGTGAACTTACTTCATGGCGTG 
AACCCAGGAGAGACCCCTGTCACCTGTACGGCAGGGATTGGGACCTTCATTGTTGAATTTGC 
CACCCTGAGCAGCCTCACTGGTGACCCGGTGTTCGAAGATGTGGCCAGAGTGGCTTTGATGC 
GCCTCTGGGAGAGCCGGTCAGATATCGGGCTGGTCGGCAACCACATTGATGTGCTCACTGGC 
AAGTGGGTGGCCCAGGACGCAGGCATCGGGGCTGGCGTGGACTCCTACTTTGAGTACTTGGT 
GAAAGGAGCCATCCTGCTTCAGGATAAGAAGCTCATGGCCATGTTCCTAGAGTATAACAAAG 
CCATCCGGAACTACACCCGCTTCGATGACTGGTACCTGTGGGTTCAGATGTACAAGGGGACT 
GTGTCCATGCCAGTCTTCCAGTCCTTGGAGGCCTACTGGCCTGGTCTTCAGAGCCTCATTGG 
AGACATTGACAATGCCATGAGGACCTTCCTCAACTACTACACTGTATGGAAGCAGTTTGGGG 
GGCTCCCGGAATTCTACAACATTCCTCAGGGATACACAGTGGAGAAGCGAGAGGGCTACCCA 
CTTCGGCCAGAACTTATTGAAAGCGCAATGTACCTCTACCGTGCCACGGGGGATCCCACCCT 
CCTAGAACTCGGAAGAGATGCTGTGGAATCCATTGAAAAAATCAGCAAGGTGGAGTGCGGAT 
TTGCAACAATCAAAGATCTGCGAGACCACAAGCTGGAGAACCGCATGGAGTCGTTCTTCCTG 
GCCGAGACTGTGAAATACCTCTACCTCCTGTTTGACCCAACCAACTTCATCCACAACAATGG 
GTCCACCTTCGACGCGGTGATCACCCCCTATGGGGAGTGCATCCTGGGGGCTGGGGGGTACA 
TCTTCAACACAGAAGCTCACCCCATCGACCTTGCCGCCCTGCACTGCTGCCAGAGGCTGAAG 
GAAGAGCAGTGGGAGGTGGAGGACTTGATGAGGGAATTCTACTCTCTCAAACGGAGCAGGTC 
GAAATTTCAGAAAAACACTGTTAGTTCGGGGCCATGGGAACCTCCAGCAAGGCCAGGAACAC 
TCTTCTCACCAGAAAACCATGACCAGGCAAGGGAGAGGAAGCCTGCCAAACAGAAGGTCCCA 
CTTCTCAGCTGCCCCAGTCAGCCCTTCACCTCCAAGTTGGCATTACTGGGACAGGTTTTCCT 
AGACTCCTCATAACCACTGGATAATTTTTTTATTTTTATTTTTTTGAGGCTAAACTATAATA 
AATTGCTTTTGGCTATCATAAAA 
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FIGURE 115 

MPFRLL I PLGLLCALLPQHHGAPGPDGS APDPAHYS FSLTL I DALDTLLI LGNVS E FQR WE 

VLQDSVDFDIDVNASVFETNIRWGGLLSAHLLSKKAGVEVEAGWPCSGPLLRMAEEAARKL 

LPAFQTPTGMPYGTVNLLHGVNPGETPVTCTAGIGTFIVEFATLSSLTGDPVFEDVARVALM 

RLWESRSDIGLVGNHIDVLTGKWAQDAGIGAGVDSYFEYLVKGAILLQDKKLMAMFLEYNK 

AIRNYTRFDDWYLWQMYKGTVSMPVFQSLEAYWPGLQSLIGDIDNAMRTFIJSr^ 

GLPEFYNIPQGYTVEKREGYPLRPELIESAMYLYRATGDPTLLELGRDAVESIEKISKVECG 

FATIKDLRDHKLDNRMESFFLAETVKYLYLLFDPTNFIHNNGSTFDAVITPYGECILGAGGY 

IFNTEAHPIDIJ^ALHCCQRLKEEQWEVEDLMREFYSLKRSRSKFQKNTVSSGPWEPPARPGT 

LFSPENHDQARERKPAKQKVPLLSCPSQPFTSKLALLGQVFLDSS 
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FIGURE 116 

AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTG 
GGCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACA 
ATTCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCT 
GAGATGGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGAGTCTACCA 
AATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATGTGGTTTTTCT 
ACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTC 
TCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCAGTGATCGCGCCTGGAGA 
AACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCT 
GGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATC 
ACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCTCACAGACCTCAGCCTG 
GAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGACCTGGGATGGAGA 
TCACCAAAGATGGCTTCCACCTGGTTATTGAGCTGGAGGACCTGGGGCCCCAGTTTGAGTTC 
CTTGTGGCCTACTGGAGGAGGGAGCCTGGTGCCGAGGAACATGTCAAAATGGTGAGGAGTGG 
GGGTATTCCAGTGCACCTAGAAACCATGGAGCCAGGGGCTGCATACTGTGTGAAGGCCCAGA 
CATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAGACAGAATGTGTGGAGGTGCAA 
GGAGAGGCCATTCCCCTGGTACTGGCCCTGTTTGCCTTTGTTGGCTTCATGCTGATCCTTGT 
GGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCTGCTCCAGTACTCCTGTTGCCCCGTGG 
TGGTCCTCCCAGACACCTTGAAAATAACCAATTCACCCCAGAAGTTAATCAGCTGCAGAAGG 
GAGGAGGTGGATGCCTGTGCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGAT 
CTCATAGGTTTGCGGAAGGGCCCAGGTGAAGCCGAGAACCTGGTCTGCATGACATGGAAACC 
ATGAGGGGACAAGTTGTGTTTCTGTTTTCCGCCACGGACAAGGGATGAGAGAAGTAGGAAGA 
GCCTGTTGTCTACAAGTCTAGAAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAGAACAC 
TGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACC 
CTGGGAAAAGTGACTTCATCCCTTCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACC 
TACACACCTGCTAAACACACACACACAGAGTCTCTCTCTATATATACACACGTACACATAAA 
TACACCCAGCACTTGCAAGGCTAGAGGGAAACTGGTGACACTCTACAGTCTGACTGATTCAG 
TGTTTCTGGAGAGCAGGACATAAATGTATGATGAGAATGATCAAGGACTCTACACACTGGGT 
GGCTTGGAGAGCCCACTTTCCCAGAATAATCCTTGAGAGAAAAGGAATCATGGGAGCAATGG 
TGTTGAGTTCACTTCAAGCCCAATGCCGGTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGT 
AGGTGACCTGGAGGAAGGTCACAGCCACACTGAAAATGGGATGTGCATGAACACGGAGGATC 
CATGAACTACTGTAAAGTGTTGACAGTGTGTGCACACTGCAGACAGCAGGTGAAATGTATGT 
GTGCAATGCGACGAGAATGCAGAAGTCAGTAACATGTGCATGTTTGTTGTGCTCCTTTTTTC 
TGTTGGTAAAGTACAGAATTCAGCAAATAAAAAGGGCCACCCTGGCCAAAAGCGGTAAAAAA 
AAAAAAAAAA 
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FIGURE 117 

MQTFTIWLEEIOTSLFMWFFYAm 

TVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATLGSQTSAW 
S ILKHPFNRNST I LTRPGME I TKDGFHLVI ELEDLGPQFE FLVAYWRRE PGAEEHVKMVRSG 
GI PVHLETMEPGAAYCVKAQTFVKAIGRYSAFSQTECVEVQGEAI PLVLALFAFVGFMLILV 
WPLFWKMGRLLQYSCCPVVVLPDTLK^ 

Important features: 
Signal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 230-255 

N-glycosylation sites. 

amino acids 40-43 and 134-137 

Tissue factor proteins homology. 

amino acids 92-119 

Integrins alpha chain protein homology. 

amino acids 232-262 
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TCCTGCTGATGCACATCTGGGTTTGGCAAAAGGAGGTTGCTTCGAGCCGCCCTTTCTAGCTT 
CCTGGCCGGCTCTAGAACAATTCAGGCTTCGCTGCGACTAGACCTCAGCTCCAACATATGCA 
TTCTGAAGAAAGATGGCTGAGATGACAGAATGCTTTATTTTGGA7UVGAAACAATGTTCTAGG 
TCAAACTGAGTCTACCAAATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCT 
TTTCATGTGGTTTTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC 
CTGCCCCTCAGAACCTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCA 
GTGATCGCGCCTGGAGAAACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCT 
GTACACGAGCCACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTG 
ATGTCACTGATGACATCACGGCCACTGTGCCATACAACCTTTGTGTCAGGGCCACATTGGGC 
TCACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTAC 
CCGACCTGGGATGGAGATCACCAAAGATGGCTTNCACCTGGTTATTGAGCTGGAGGACCTGG 
GGCCCCAGTTTGAGTTCCTTGTGGCCTANTGGAGGAGGGGCGAACCCCTTGCGGCGCAAGGG 
GTTNGCGAACCCCTTGCGGCCGCTGGGGTATCTCTCGAGAAAAGAGAGGCCCAATATGACCCAC 
ATACTCAATATGGACGAANTGCTATTGTCCACCTGTTTGAGTGGCGCTGGGTTGAT 
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FIGURE 119 

CGGACGCGTGGGCCGCCACCTCCGGAACAAGCCATGGTGGCGGCGACGGTGGCAGCGGCGTG 
GCTGCTCCTGTGGGCTGCGGCCTGCGCGCAGCAGGAGCAGGACTTCTACGACTTCAAGGCGG 
TCAACATCCGGGGCAAACTGGTGTCGCTGGAGAAGTACCGCGGATCGGTGTCCCTGGTGGTG 
AATGTGGCCAGCGAGTGCGGCTTCACAGACCAGCACTACCGAGCCCTGCAGCAGCTGCAGCG 
AGACCTGGGCCCCCACCACTTTAACGTGCTCGCCTTCCCCTGCAACCAGTTTGGCCAACAGG 
AGCCTGACAGCAACAAGGAGATTGAGAGCTTTGCCCGCCGCACCTACAGTGTCTCATTCCCC 
ATGTTTAGCAAGATTGCAGTCACCGGTACTGGTGCCCATCCTGCCTTCAAGTACCTGGCCCA 
GACTTCTGGGAAGGAGCCCACCTGGAACTTCTGGAAGTACCTAGTAGCCCCAGATGGAAAGG 
TGGTAGGGGCTTGGGACCCAACTGTGTCAGTGGAGGAGGTCAGACCCCAGATCACAGCGCTC 
GTGAGGAAGCTCATCCTACTGAAGCGAGAAGACTTATAACCACCGCGTCTCCTCCTCCACCA 
CCTCATCCCGCCCACCTGTGTGGGGCTGACCAATGCAAACTCAAATGGTGCTTCAAAGGGAG 
AGACCCACTGACTCTCCTTCCTTTACTCTTATGCCATTGGTCCCATCATTCTTGTGGGGGAA 
AAATTCTAGTATTTTGATTATTTGAATCTTACAGCAACAAATAGGAACTCCTGGCCAATGAG 
AGCTCTTGACCAGTGAATCACCAGCCGATACGAACGTCTTGCCAACAAAAATGTGTGGCAAA 
TAGAAGTATATCAAGCAATAATCTCCCACCCAAGGCTTCTGTAAACTGGGACCAATGATTAC 
CTCATAGGGCTGTTGTGAGGATTAGGATGAAATACCTGTGAAAGTGCCTAGGCAGTGCCAGC 
CAAATAGGAGGCATTCAATGAACATTTTTTGCATATAAACCAAAAAATAACTTGTTATCAAT 
AAAAACTTGCATCCAACATGAATTTCCAGCCGATGATAATCCAGGCCAAAGGTTTAGTTGTT 
GTTATTTCCTCTGTATTATTTTCTTCATTACAAAAGAAATGCAAGTTCATTGTAACAATCCA 
AACAATACCTCACGATATAAAATAAAAATGAAAGTATCCTCCTCAAAAA 
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FIGURE 120 

MVAATVAAAWLLLWAAACAQQEQDFYDFKAVNIRGKLVSLEKYRGSVSLVVNVASECGFTDQ 
HYRALQQLQRDLGPHHFNVIJ^PCNQFGQQEPDSl^EIESFARRTYSVSFPMFSKIAV 
AHPAFKYLAQTSGKE PTWNFWKYLVAPDGKWGAWDPTVS VEEVRPQ ITALVRKLI LLKREDL 
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CGGACGCGTGGGCGGGCCGGGACGCAGGGCAAAGCGAGCCATGGCTGTCTACGTCGGGATGC 
TGCGCCTGGGGAGGCTGTGCGCCGGGAGCTCGGGGGTGCTGGGGGCCCGGGCCGCCCTCTCT 
CGGAGTTGGCAGGAAGCCAGGTTGCAGGGTGTCCGCTTCCTCAGTTCCAGAGAGGTGGATCG 
CATGGTCTCCACGCCCATCGGAGGCCTCAGCTACGTTCAGGGGTGCACCAAAAAGCATCTTA 
ACAGCAAGACTGTGGGCCAGTGCCTGGAGACCACAGCACAGAGGGTCCCAGAACGAGAGGCC 
TTGGTCGTCCTCCATGAAGACGTCAGGTTGACCTTTGCCCAACTCAAGGAGGAGGTGGACAA 
AGCTGCTTCTGGCCTCCTGAGCATTGGCCTCTGCAAAGGTGACCGGCTGGGCATGTGGGGAC 
CTAACTCCTATGCATGGGTGCTCATGCAGTTGGCCACCGCCCAGGCGGGCATCATTCTGGTG 
TCTGTGAACCCAGCCTACCAGGCTATGGAACTGGAGTATGTCCTCAAGAAGGTGGGCTGCAA 
GGCCCTTGTGTTCCCCAAGCAATTCAAGACCCAGCAATACTACAACGTCCTGAAGCAGATCT 
GTCCAGAAGTGGAGAATGCCCAGCCAGGGGCCTTGAAGAGTCAGAGGCTCCCAGATCTGACC 
ACAGTCATCTCGGTGGATGCCCCTTTGCCGGGGACCCTGCTCCTGGATGAAGTGGTGGCGGC 
TGGCAGCACACGGCAGCATCTGGACCAGCTCCAATACAACCAGCAGTTCCTGTCCTGCCATG 
ACCCCATCAACATCCAGTTCACCTCGGGGACAACAGGCAGCCCCAAGGGGGCCACCCTCTCC 
CACTACAACATTGTCAACAACTCCAACATTTTAGGAGAGCGCCTGAAACTGCATGAGAAGAC 
ACCAGAGCAGTTGCGGATGATCCTGCCCAACCCCCTGTACCATTGCCTGGGTTCCGTGGCAG 
GCACAATGATGTGTCTGATGTACGGTGCCACCCTCATCCTGGCCTCTCCCATCTTCAATGGC 
AAGAAGGCACTGGAGGCCATCAGCAGAGAGAGAGGCACCTTCCTGTATGGTACCCCCACGAT 
GTTCGTGGACATTCTGAACCAGCCAGACTTCTCCAGTTATGACATCTCGACCATGTGTGGAG 
GTGTCATTGCTGGGTCCCCTGCACCTCCAGAGTTGATCCGAGCCATCATCAACAAGATAAAT 
ATGAAGGACCTGGTGGTTGCTTATGGAACCACAGAGAACAGTCCCGTGACATTCGCGCACTT 
CCCTGAGGACACTGTGGAGCAGAAGGCAGAAAGCGTGGGCAGAATTATGCCTCACACGGAGG 
CCCGGATCATGAACATGGAGGCAGGGACGCTGGCAAAGCTGAACACGCCCGGGGAGCTGTGC 
ATCCGAGGGTACTGCGTCATGCTGGGCTACTGGGGTGAGCCTCAGAAGACAGAGGAAGCAGT 
GGATCAGGACAAGTGGTATTGGACAGGAGATGTCGCCACAATGAATGAGCAGGGCTTCTGCA 
AGATCGTGGGCCGCTCTAAGGATATGATCATCCGGGGTGGTGAGAACATCTACCCCGCAGAG 
CTCGAGGACTTCTTTCACACACACCCGAAGGTGCAGGAAGTGCAGGTGGTGGGAGTGAAGGA 
CGATCGGATGGGGGAAGAGATTTGTGCCTGCATTCGGCTGAAGGACGGGGAGGAGACCACGG 
TGGAGGAGATAAAAGCTTTCTGCAAAGGGAAGATCTCTCACTTCAAGATTCCGAAGTACATC 
GTGTTTGTCACAAACTACCCCCTCACCATTTCAGGAAAGATCCAGAAATTCAAACTTCGAGA 
GCAGATGGAACGACATCTAAATCTGTGAATAAAGCAGCAGGCCTGTCCTGGCCGGTTGGCTT 
GACTCTCTCCTGTCAGAATGCAACCTGGCTTTATGCACCTAGATGTCCCCAGCACCCAGTTC 
TGAGCCAGGCACATCAAATGTCAAGGAATTGACTGAACGAACTAAGAGCTCCTGGATGGGTC 
CGGGAACTCGCCTGGGCACAAGGTGCCAAAAGGCAGGCAGCCTGCCCAGGCCCTCCCTCCTG 
TCCATCCCCCACATTCCCCTGTCTGTCCTTGTGATTTGGCATAAAGAGCTTCTGTTTTCTTT 
GAAAAAAAAAAAAAAAA 
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FIGURE 122 

MAVWGMLRLGRLCAGSSGVLGARAALSRSWQEARLQGVRFLSSREVDRMVSTPIGGLSYVQ 
GCTKKHLNSKTVGQCLETTAQRVPEREALWLHEDVRLTFAQLKEEVDKAASGLLSIGLCKG 
DRLGMWGPNS YAWVLMQLATAQAG 1 1 LVSVNPAYQAMELE YVLKKVGCKALVFPKQFKTQQY 
YNVLKQICPEVENAQPGALKSQRLPDLTTVISVDAPLPGTLLLDEWAAGSTRQHLDQLQYN 
QQFLSCHDPINIQFTSGTTGSPKGATLSHYNIVNNSNILGERLKLHEKTPEQLRMILPNPLY 
HCLGSVAGTMMCLMYGATLILASPIFNGKKALEAISRERGTFLYGTPTMFVDILNQPDFSSY 
DISTMCGGVIAGSPAPPELIRAIINKINMKDLWAYGTTENSPVTFAHFPEDTVEQKAESVG 
RIMPHTEARII^MEAGTLAKLNTPGELCIRGYCVMLGYWGEPQKTEEAVDQDKWYWTGDVAT 
MNEQGFCKIVGRSKDMIIRGGENIYPAELEDFFHTHPKVQEVQWGVKDDRMGEEICACIRL 
KDGEETTVEEIKAFCKGKISHFKIPKYIVFVTNYPLTISGKIQKFKLREQMERHLNL 
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FIGURE 123 

CAACTCCAACATTTTAGGAGAGCGCCTGAAACTGCATGAGAAGACACCAGAGCAGTTGCGGA 
TGATCCTGCCCAACCCCCTGTACCATTGCCTGGGTTCCGTGGCAGGCACAATGATGTGTCTG 
ATGTACGGTGCCACCCTCATCCTGGCCTCTCCCATCTTCAATGGCAAGAAGGCACTGGAGGC 
CATCAGCAGAGAGAGAGGCACCTTCCTGTATGGTACCCCCACGATGTTCGTGGACATTCTGA 
ACCAGCCAGACTTCTCCAGTTATGACATCTCGACCATGTGTGGAGGTGTCATTGCTGGGTCC 
CCTGCACCTCCAGAGTTGATCCGAGCCATCATCAACAAGATAAATATGAAGGACCTGGTGGT 
TGCTTATGGAACCACAGAGAACAGTCCCGTGACATTCGCGCACTTCCCTGAGGACACTGTGG 
AGCAGAAGGCAGAAAGCGTGGGCAGAATTATGCCTCACACGGAGGCGCGGATCATGAACATG 
GAGGCAGGGACGCTGGCAAAGCTGAACACGCCCGGGGAGCTGTGCATCCGAGGGTACTGCGT 
CATGCTGGGCTACTGGGGTGAGCCTCAGAAGACAGAGGAAGCAGTGGATCAGGACAAGTGGT 
ATTGGACAGGAGATGTCGCCAC 
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FIGURE 124 

GAGCAGGACGGAGCCATGGACCCCGCCAGGAAAGCAGGTGCCCAGGCCATGATCTGGACTGC 
AGGCTGGCTGCTGCTGCTGCTGCTTCGCGGAGGAGCGCAGGCCCTGGAGTGCTACAGCTGCG 
TGCAGAAAGCAGATGACGGATGCTCCCCGAACAAGATGAAGACAGTGAAGTGCGCGCCGGGC 
GTGGACGTCTGCACCGAGGCCGTGGGGGCGGTGGAGACCATCCACGGACAATTCTCGCTGGC 
AGTGCGGGGTTGCGGTTCGGGACTCCCCGGCAAGAATGACCGCGGCCTGGATCTTCACGGGC 
TTCTGGCGTTCATCCAGCTGCAGCAATGCGCTCAGGATCGCTGCAACGCCAAGCTCAACCTC 
ACCTCGCGGGCGCTCGACCCGGCAGGTAATGAGAGTGCATACCCGCCCAACGGCGTGGAGTG 
CTACAGCTGTGTGGGCCTGAGCCGGGAGGCGTGCCAGGGTACATCGCCGCCGGTCGTGAGCT 
GCTACAACGCCAGCGATCATGTCTACAAGGGCTGCTTCGACGGCAACGTCACCTTGACGGCA 
GCTAATGTGACTGTGTCCTTGCCTGTCCGGGGCTGTGTCCAGGATGAATTCTGCACTCGGGA 
TGGAGTAACAGGCCCAGGGTTCACGCTCAGTGGCTCCTGTTGCCAGGGGTCCCGCTGTAACT 
CTGACCTCCGCAACAAGACCTACTTCTCCCCTCGAATCCCACCCCTTGTCCGGCTGCCCCCT 
CCAGAGCCCACGACTGTGGCCTCAACCACATCTGTCACCACTTCTACCTCGGCCCCAGTGAG 
AC CCAC AT C C ACCACCAAAC CCATGC CAG CGC C AAC CAGT CAGACT C CG AGAC AGGG AGT AG 
AACACGAGGCCTCCCGGGATGAGGAGCCCAGGTTGACTGGAGGCGCCGCTGGCCACCAGGAC 
CGCAGCAATTCAGGGCAGTATCCTGCAAAAGGGGGGCCCCAGCAGCCCCATAATAAAGGCTG 
TGTGGCTCCCACAGCTGGATTGGCAGCCCTTCTGTTGGCCGTGGCTGCTGGTGTCCTACTGT 
GAGCTTCTCCACCTGGAAATTTCCCTCTCACCTACTTCTCTGGCCCTGGGTACCCCTCTTCT 
CATCACTTCCTGTTCCCACCACTGGACTGGGCTGGCCCAGCCCCTGTTTTTCCAACATTCCC 
CAGTATCCCCAGCTTCTGCTGCGCTGGTTTGCGGCTTTGGGAAATAAAATACCGTTGTATAT 
ATTCTGCCAGGGGTGTTCTAGCTTTTTGAGGACAGCTCCTGTATCCTTCTCATCCTTGTCTC 
TCCGCTTGTCCTCTTGTGATGTTAGGACAGAGTGAGAGAAGTCAGCTGTCACGGGGAAGGTG 
AGAGAGAGGATGCTAAGCTTCCTACTCACTTTCTCCTAGCCAGCCTGGACTTTGGAGCGTGG 
GGTGGGTGGGACAATGGCTCCCCACTCTAAGCACTGCCTCCCCTACTCCCCG'CATCTTTGGG 
GAATCGGTTCCCCATATGTCTTCCTTACTAGACTGTGAGCTCCTCGAGGGGGGGCCCGGTAC 
CCAATTCGCCCTATAGTGAGTCGTA 
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FIGURE 125 

MDPARKAGAQAMIWTAGWLLLLLLRGGAQALECYSCVQKADDGCSPNKMKTVKCAPGVDVCT 

EAVGAVETIHGQFSLAVRGCGSGLPGKNDRGLDLHGLLAFIQLQQCAQDRCNAKLNLTSRAL 

DPAGNESAYPPNGVECYSCVGLSREACQGTSPPWSC^ASDHVYKGC^ 

SLPVRGCVQDEFCTRDGVTGPGFTLSGSCCQGSRCNSDLRNKTYFSPRIPPLVRLPPPEPTT 

VASTTSVTTSTSAPVRPTSTTKPMPAPTSQTPRQGVEHEASRDEEPRLTGGAAGHQDRSNSG 

QYPAKGGPQQPHNKGCVAPTAGLAALLLAVAAGVLL 
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FIGURE 126 

CGGGACTCGGCGGGTCCTCCTGGGAGTCTCGGAGGGGACCGGCTGTGCAGACGCCATGGAGT 
TGGTGCTGGTCTTCCTCTGCAGCCTGCTGGCCCCCATGGTCCTGGCCAGTGCAGCTGAAAAG 
GAGAAGGAAATGGACCCTTTTCATTATGATTACCAGACCCTGAGGATTGGGGGACTGGTGTT 
CGCTGTGGTCCTCTTCTCGGTTGGGATCCTCCTTATCCTAAGTCGCAGGTGCAAGTGCAGTT 
TCAATCAGAAGCCCCGGGCCCCAGGAGATGAGGAAGCCCAGGTGGAGAACCTCATCACCGCC 
AATGCAACAGAGCCCCAGAAGCAGAGAACTGAAGTGCAGCCATCAGGTGGAAGCCTCTGGAA 
CCTGAGGCGGCTGCTTGAACCTTTGGATGCAAATGTCGATGCTTAAGAAAACCGGCCACTTC 
AGCAACAGCCCTTTCCCCAGGAGAAGCCAAGAACTTGTGTGTCCCCCACCCTATCCCCTCTA 
ACACCATTCCTCCACCTGATGATGCAACTAACACTTGCCTCCCCACTGCAGCCTGCGGTCCT 
GCCCACCTCCCGTGATGTGTGTGTGTGTGTGTGTGTGTGACTGTGTGTGTTTGCTAACTGTG 
GTCTTTGTGGCTACTTGTTTGTGGATGGTATTGTGTTTGTTAGTGAACTGTGGACTCGCTTT 
CCCAGGCAGGGGCTGAGCCACATGGCCATCTGCTCCTCCCTGCCCCCGTGGCCCTCCATCAC 
CTTCTGCTCCTAGGAGGCTGCTTGTTGCCCGAGACCAGCCCCCTCCCCTGATTTAGGGATGC 
GTAGGGTAAGAGCACGGGCAGTGGTCTTCAGTCGTCTTGGGACCTGGGAAGGTTTGCAGCAC 
TTTGTCATCATTCTTCATGGACTCCTTTCACTCCTTTAACAAAAACCTTGCTTCCTTATCCC 
ACCTGATCCCAGTCTGAAGGTCTCTTAGCAACTGGAGATACAAAGCAAGGAGCTGGTGAGCC 
CAGCGTTGACGTCAGGCAGGCTATGCCCTTCCGTGGTTAATTTCTTCCCAGGGGCTTCCACG 
AGGAGTCCCCATCTGCCCCGCCCCTTCACAGAGCGCCCGGGGATTCCAGGCCCAGGGCTTCT 
ACTCTGCCCCTGGGGAATGTGTCCCCTGCATATCTTCTCAGCAATAACTCCATGGGCTCTGG 
GACCCTACCCCTTCCAACCTTCCCTGCTTCTGAGACTTCAATCTACAGCCCAGCTCATCCAG 
ATGCAGACTACAGTCCCTGCAATTGGGTCTCTGGCAGGCAATAGTTGAAGGACTCCTGTTCC 
GTTGGGGCCAGCACACCGGGATGGATGGAGGGAGAGCAGAGGCCTTTGCTTCTCTGCCTACG 
TCCCCTTAGATGGGCAGCAGAGGCAACTCCCGCATCCTTTGCTCTGCCTGTCGGTGGTCAGA 
GCGGTGAGCGAGGTGGGTTGGAGACT CAGC AGGCTC CGTG C AG C CCTTGGGAAC AGTGAGAG 
GTTGAAGGTCATAACGAGAGTGGGAACTCAACCCAGATCCCGCCCCTCCTGTCCTCTGTGTT 
CCCGCGGAAACCAACCAAACCGTGCGCTGTGACCCATTGCTGTTCTCTGTATCGTGATCTAT 
CCTCAACAACAACAGAAAAAAGGAATAAAATATCCTTTGTTTCCT 
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FIGURE 127 

MELVLVFLCSLLAPMVLASAAEKEKEMDPFHYDYQTLRIGGLVFAVVLFSVGILLILSRRCK 
CSFNQKPRAPGDEEAQVENLITANATEPQKQRTEVQPSGGSLWNLRRLLEPLDANVDA 
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FIGURE 128 

AAACTTGACGCCATGAAGATCCCGGTCCTTCCTGCCGTGGTGCTCCTCTCCCTCCTGGTGCT 
CCACTCTGCCCAGGGAGCCACCCTGGGTGGTCCTGAGGAAGAAAGCACCATTGAGAATTATG 
CGTCACGACCCGAGGCCTTTAACACCCCGTTCCTGAACATCGACAAATTGCGATCTGCGTTT 
AAGGCTGATGAGTTCCTGAACTGGCACGCCCTCTTTGAGTCTATCAAAAGGAAACTTCCTTT 
CCTCAACTGGGATGCCTTTCCTAAGCTGAAAGGACTGAGGAGCGCAACTCCTGATGCCCAGT 
GACCATGACCTCCACTGGAAGAGGGGGCTAGCGTGAGCGCTGATTCTCAACCTACCATAACT 
CTTTCCTGCCTCAGGAACTCCAATAAAACATTTTCCATCCAAA 
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FIGURE 129 

MKIPVLPAWLLSLLVLHSAQGATLGGPEEESTIENYASRPEAFNTPFLNIDKLRSAFKADE 
FLNWHALFES I KRKLPFLNWDAFPKLKGLRSATPDAQ 
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FIGURE 130 

CAGTTCTGAAATCAATGGAGTTAATTTAGGGAATACAAACCAGCCATGGGGGTGGAGATTGC 
CTTTGCCTCAGTGATTCTCACCTGCCTCTCCCTTCTGGCAGCAGGAGTCTCCCAGGTTGTTC 
TTCTCCAGCCAGTTCCAACTCAGGAGACAGGTCCCAAGGCCATGGGAGATCTCTCCTGTGGC 
TTTGCCGGCCACTCATGAGAGTGTTTTTGTGTAAAGTATTTTTTAGAATACTGTTGACTTCT 
TCATGATTTAATAACCATCCTTTGCGAAGTTTTATGAGGCTTTAGGGGAATGTCAACCCTCA 
AATTTTTGTTATACTAGATGGCTTCCATTTACCCACCACTATTTTAAGGTCCCTTTATTTTT 
AGGTTCAAGGTTCATTTGACTTGAGAAAGTGCCCTTCTGCAGCTTCATTGATTTTGTTTATC 
TTCACTATTAATTGTAACGATTAAAAAAGAAT AAGAGCACGCAGAC CT CTAGGAGAATATTT 
TATCCCTGGGTGCCCCTGACACATTTATGTAGTGATCCCACAAATGTGATTGTTAATTTAAA 
TGTTATTCTAATATTAGTACATTCAGTTGTGATGTAATATGAATAACCAGAATCTATTTCTT 
AAAAGTTTTGAGTATATTTTTCAACTAGATATTTGTATAGAAAGACTGAATAGTGATG 
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FIGURE 131 



MGVEIAFASVILTCLSLLAAGVSQWLLQPVPTQETGPKAMGDLSCGFAGHS 
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FIGURE 132 



GGGGAATCTGCAGTAGGTCTGCCGGCGATGGAGTGGTGGGCTAGCTCGCCGCTTCGGCTCTG 
GCTGCTGTTGTTCCTCCTGCCCTCAGCGCAGGGCCGCCAGAAGGAGTCAGGTTCAAAATGGA 
AAGTATTTATTGACCAAATTAACAGGTCTTTGGAGAATTACGAACCATGTTCAAGTCAAAAC 
TGCAGCTGCTACCATGGTGTCATAGAAGAGGATCTAACTCCTTTCCGAGGAGGCATCTCCAG 
GAAGATGATGGCAGAGGTAGTCAGACGGAAGCTAGGGACCCACTATCAGATCACTAAGAACA 
GACTGTACCGGGAAAATGACTGCATGTTCCCCTCAAGGTGTAGTGGTGTTGAGCACTTTATT 
TTGGAAGTGATCGGGCGTCTCCCTGACATGGAGATGGTGATCAATGTACGAGATTATCCTCA 
GGTTCCTAAATGGATGGAGCCTGCCATCCCAGTCTTCTCCTTCAGTAAGACATCAGAGTACC 
ATGATATCATGTATCCTGCTTGGACATTTTGGGAAGGGGGACCTGCTGTTTGGCCAATTTAT 
CCTACAGGTCTTGGACGGTGGGACCTCTTCAGAGAAGATCTGGTAAGGTCAGCAGCACAGTG 
GCCATGGAAAAAGAAAAACTCTACAGCATATTTCCGAGGATCAAGGACAAGTCCAGAACGAG 
ATCCTCTCATTCTTCTGTCTCGGAAAAACCCAAAACTTGTTGATGCAGAATACACCAAAAAC 
CAGGCCTGGAAATCTATGAAAGATACCTTAGGAAAGCCAGCTGCTAAGGATGTCCATCTTGT 
GGATCACTGCAAATACAAGTATCTGTTTAATTTTCGAGGCGTAGCTGCAAGTTTCCGGTTTA 
AACACCTCTTCCTGTGTGGCTCACTTGTTTTCCATGTTGGTGATGAGTGGCTAGAATTCTTC 
TATCCACAGCTGAAGCCATGGGTTCACTATATCCCAGTCAAAACAGATCTCTCCAATGTCCA 
AGAGCTGTTACAATTTGTAAAAGCAAATGATGATGTAGCTCAAGAGATTGCTGAAAGGGGAA 
GCCAGTTTATTAGGAACCATTTGCAGATGGATGACATCACCTGTTACTGGGAGAACCTCTTG 
AGTGAATACTCTAAATTCCTGTCTTATAATGTAACGAGAAGGAAAGGTTATGATCAAATTAT 
TCCCAAAATGTTGAAAACTGAACTATAGTAGTCATCATAGGACCATAGTCCTCTTTGTGGCA 
ACAGATCTCAGATATCCTACGGTGAGAAGCTTACCATAAGCTTGGCTCCTATACCTTGAATA 
TCTGCTATCAAGCCAAATACCTGGTTTTCCTTATCATGCTGCACCCAGAGCAACTCTTGAGA 
AAGATTTAAAATGTGTCTAATACACTGATATGAAGCAGTTCAACTTTTTGGATGAATAAGGA 
CCAGAAATCGTGAGATGTGGATTTTGAACCCAACTCTACCTTTCATTTTCTTAAGACCAATC 
ACAGCTTGTGCCTCAGATCATCCACCTGTGTGAGTCCATCACTGTGAAATTGACTGTGTCCA 
TGTGATGATGCCCTTTGTCCCATTATTTGGAGCAGAAAATTCGTCATTTGGAAGTAGTACAA 
CTCATTGCTGGAATTGTGAAATTATTCAAGGCGTGATCTCTGTCACTTTATTTTAATGTAGG 
AAACCCTATGGGGTTTATGAAAAATACTTGGGGATCATTCTCTGAATGGTCTAAGGAAGCGG 
TAGCCATGCCATGCAATGATGTAGGAGTTCTCTTTTGTAAAACCATAAACTCTGTTACTCAG 
GAGGTTTCTATAATGCCACATAGAAAGAGGCCAATTGCATGAGTAATTATTGCAATTGGATT 
TCAGGTTCCCTTTTTGTGCCTTCATGCCCTACTTCTTAATGCCTCTCTAAAGCCAAA 
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FIGURE 133 

MEWWASSPLRLWLLLFLLPSAQGRQKESGSKWKVFIDQINRSLENYEPCSSQNCSCYHGVIE 
EDLTPFRGGISRKMMAEVVRRKLGTHYQITKNRLYRENDCMFPSRCSGVEHFILEVIGRLPD 
MEMVINVRDYPQVPKWMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDL 
FREDLVRSAAQWPWKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKSMKDT 
LGKPAAKDVHLVDHCKYKYLFNFRGVAASFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVH 
YIPVKTDLSNVQELLQFVKANDDVAQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 
NVTRRKGYDQ 1 1 PKMLKTEL 
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FIGURE 134 

CACCCCTCCATTTCTCGCCATGGCCCCTGCACTGCTCCTGATCCCTGCTGCCCTCGCCTCTT 
TCATCCTGGCCTTTGGCACCGGAGTGGAGTTCGTGCGCTTTACCTCCCTTCGGCCACTTCTT 
GGAGGGATCCCGGAGTCTGGTGGTCCGGATGCCCGCCAGGGATGGCTGGCTGCCCTGCAGGA 
CCGCAGCATCCTTGCCCCCCTGGCATGGGATCTGGGGCTCCTGCTTCTATTTGTTGGGCAGC 
ACAGCCTCATGGCAGCTGAAAGAGTGAAGGCATGGACATCCCGGTACTTTGGGGTCCTTCAG 
AGGTCACTGTATGTGGCCTGCACTGCCCTGGCCTTGCAGCTGGTGATGCGGTACTGGGAGCC 
CATACCCAAAGGCCCTGTGTTGTGGGAGGCTCGGGCTGAGCCATGGGCCACCTGGGTGCCGC 
TCCTCTGCTTTGTGCTCCATGTCATCTCCTGGCTCCTCATCTTTAGCATCCTTCTCGTCTTT 
GACTATGCTGAGCTCATGGGCCTCAAACAGGTATACTACCATGTGCTGGGGCTGGGCGAGCC 
TCTGGCCCTGAAGTCTCCCCGGGCTCTCAGACTCTTCTCCCACCTGCGCCACCCAGTGTGTG 
TGGAGCTGCTGACAGTGCTGTGGGTGGTGCCTACCCTGGGCACGGACCGTCTCCTCCTTGCT 
TTCCTCCTTACCCTCTACCTGGGCCTGGCTCACGGGCTTGATCAGCAAGACCTCCGCTACCT 
CCGGGCCCAGCTACAAAGAAAACTCCACCTGCTCTCTCGGCCCCAGGATGGGGAGGCAGAGT 
GAGGAGCTCACTCTGGTTACAAGCCCTGTTCTTCCTCTCCCACTGAATTCTAAATCCTTAAC 
ATCCAGGCCCTGGCTGCTTCATGCCAGAGGCCCAAATCCATGGACTGAAGGAGATGCCCCTT 
CTACTACTTGAGACTTTATTCTCTGGGTCCAGCTCCATACCCTAAATTCTGAGTTTCAGCCA 
CTGAACTCCAAGGTCCACTTCTCACCAGCAAGGAAGAGTGGGGTATGGAAGTCATCTGTCCC 
TTCACTGTTTAGAGCATGACACTCTCCCCCTCAACAGCCTCCTGAGAAGGAAAGGATCTGCC 
CTGACCACTCCCCTGGCACTGTTACTTGCCTCTGCGCCTCAGGGGTCCCCTTCTGCACCGCT 
GGCTTCCACTCCAAGAAGGTGGACCAGGGTCTGCAAGTTCAACGGTCATAGCTGTCCCTCCA 
GGCCCCAACCTTGCCTCACCACTCCCGGCCCTAGTCTCTGCACCTCCTTAGGCCCTGCCTCT 
GGGCTCAGACCCCAACCTAGTCAAGGGGATTCTCCTGCTCTTAACTCGATGACTTGGGGCTC 
CCTGCTCTCCCGAGGAAGATGCTCTGCAGGAAAATAAAAGTCAGCCTTTTTCTAAAAAAAA 
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FIGURE 135 

MAPALLLIPAALASFILAFGTGVEFWFTSLRPLLGGIPESGGPDARQGWLAALQDRSILAP 

LAWDLGLLLLFVGQHSLMAAERVKAWTSRYFGVLQRSLW 

LWEARAEPWATWPLLCFVLHVISWLLIFSILL^ 

RALRLFSHLRHPVCVELLTVLWVVPTLGTDRLLLAFLLTLYLGLAHGLDQQDLRYLRAQLQR 
KLHLLSRPQDGEAE 
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CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGA 
AGAAATTGCCAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGT 
TTTAATTAAGCATGGAATACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATT 
CCACAGTTTTCTTAGCTCCCTGGACCCGGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTA 
TCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTAAAGAACCTTCGGCTCGCGTGCTTCTG 
AGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTAGGATGTCACTGAGATCC 
CTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTACCTCAGCCT 
TCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTT 
ACAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTT 
CTGGTCATTCTGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTAC 
TTGGGGTGAAAAAAAGTCTTGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAG 
AGGCTGAAAAGGAAGACAAAATGTTGGCATTGTCCTTAGAGGATGAACACCTTCTTTATGGT 
GACATAATCCGACAAGATTTTTTAGACACATATAATAACCTGACCTTGAAAACCATTATGGC 
ATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATGAAGACAGACACTGATG 
TTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGAGAAGTTT 
TTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATAT 
TTCTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAA 
TGTCCAGAGATTTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTT 
GAAGATGTTTATGTCGGGATCTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGA 
CACAAATCTTTTCTTTCTATATAGAATCCATTTGGATGTCTGTCAACTGAGACGTGTGATTG 
CAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTTTGGCAGGTCATGCTAAGGAACACC 
ACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAGGATACCTTGTGGAAA 
GTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTTACACTG 
AACTGAAACTCATGAAAAACCCAGACTGGAGACTGGAGGGTTACACTTGTGATTTATTAGTC 
AGGCCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAA 
GAAATTAATAGGACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGG 
TGTTACTGAGTTATAAGCTCACTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTG 
AACAATGTAGTCACTTGAAGGTTTTGTGTATATCTTATGTGGATTACCAATTTAAAAATATA 
TGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATACTGAACAAAATTTTACCTGTTTT 
TGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATTATTATTTAAAATTA 
CTTCAACTTTGTGTTTTTAAATGTTTTGACGATTTCAATACAAGATAAAAAGGATAGTGAAT 
CATTCTTTACATGCAAACATTTTCCAGTTACTTAACTGATCAGTTTATTATTGATACATCAC 
TCCATTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAAT 
ATTTTACTGTGGTAATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 
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FIGURE 137 

MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHY3STVI ERVNWMYF YE YE P I YRQD 
FHFTLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEK 
EDKMLALSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFIN 
TGNLVKYLLNLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRD 
L VPR I YEMMGHVKP I KFED VYVG I CLNLLKVN I H I P EDTNLF F L YR I HLD VCQLRRV I AAHG 
FSSKE I I TFWQVMLRNTTCHY 
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FIGURE 138 

CCTCTGTCCACTGCTTTCGTGAAGACAAGATGAAGTTCACAATTGTCTTTGCTGGACTTCTT 
GGAGTCTTTCTAGCTCCTGCCCTAGCTAACTATAATATCAACGTCAATGATGACAACAACAA 
TGCTGGAAGTGGGCAGCAGTCAGTGAGTGTCAACAATGAACACAATGTGGCCAATGTTGACA 
ATAACAACGGATGGGACTCCTGGAATTCCATCTGGGATTATGGAAATGGCTTTGCTGCAACC 
AGACTCTTTCAAAAGAAGACATGCATTGTGCACAAAATGAACAAGGAAGTCATGCCCTCCAT 
TCAATCCCTTGATGCACTGGTCAAGGAAAAGAAGCTTCAGGGTAAGGGACCAGGAGGACCAC 
CTCCCAAGGGCCTGATGTACTCAGTCAACCCAAACAAAGTCGATGACCTGAGCAAGTTCGGA 
AAAAACATTGCAAACATGTGTCGTGGGATTCCAACATACATGGCTGAGGAGATGCAAGAGGC 
AAGCCTGTTTTTTTACTCAGGAACGTGCTACACGACCAGTGTACTATGGATTGTGGACATTT 
CCTTCTGTGGAGACACGGTGGAGAACTAAACAATTTTTTAAAGCCACTATGGATTTAGTCAT 
CTGAATATGCTGTGCAGAAAAAATATGGGCTCCAGTGGTTTTTACCATGTCATTCTGAAATT 
TTTCTCTACTAGTTATGTTTGATTTCTTTAAGTTTCAATAAAATCATTTAGCATTGAAAAAAA 
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FIGURE 139 

MKFTIVFAGLLGVFLAPALANYWINVNDDNNNAGSGQQSVSVN^ 

IWDYGNGFAATRLFQKKTCIVHKMNKEVMPSIQSLDALVKEKKLQGKGPGGPPPKGLMYSVN 
PNKVDDLSKFGKNIANMCRGIPTYMAEEMQEASLFFYSGTCYTTSVLWIVDISFCGDTVEN 
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FIGURE 140 

CATTTCTGAAACTAATCGTGTCAGAATTGACTTTGAAAAGCATTGCTTTTTACAGAAGTATA 
TTAACTTTTTAGGAGTAATTTCTAGTTTGGATTGTAATATGAAATAATTTAAAAGGGCTTCG 
CTCATATATAGGAAAATCGCATATGGTCCTAGTATTAAATTCTTATTGCTTACTGATTTTTT 
TGAGTTAAGAGTTGTTATATGCTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAT^AA 
GAATAAAGTAGATTGAGTCTCCAATTTTATGTAAGCTTCAGAAGAACTGGTTTGTTTACATG 
CAAGCTTATAGTTGAAATATTTTTCAGGAATTACATGAATGACAGTCTTCGAACCAATGTGT 
TTGTTCGATTTCAACCAGAGACTATAGCATGTGCTTGCATCTACCTTGCAGCTAGAGCACTT 
CAGATTCCGTTGCCAACTCGTCCCCATTGGTTTCTTCTTTTTGGTACTACAGAAGAGGAAAT 
CCAGGAAATCTGCATAGAAACACTTAGGCTTTATACCAGAAAAAAGCCAAACTATGAATTAC 
TGGAAAAAGAAGTAGAAAAAAGAAAAGTAGCCTTACAAGAAGCCAAATTAAAAGCAAAGGGA 
TTGAATCCGGATGGAACTCCAGCCCTTTCAACCCTGGGTGGATTTTCTCCAGCCTCCAAGCC 
ATCATCACCAAGAGAAGTAAAAGCTGAAGAGAAATCACCAATCTCCATTAATGTGAAGACAG 
TCAAAAAAGAACCTGAGGATAGACAACAGGCTTCCAAAAGCCCTTACAATGGTGTAAGAAAA 
GACAGCAAGAGAAGTAGAAATAGCAGAAGTGCAAGTCGATCGAGGTCAAGAACACGATCACG 
TTCTAGATCACATACTCCAAGAAGACACTATAATAATAGGCGGAGTCGATCTGGAACATACA 
GCTCGAGATCAAGAAGCAGGTCCCGCAGTCACAGTGAAAGCCCTCGAAGACATCATAATCAT 
GGTTCTCCTCACCTTAAGGCCAAGCATACCAGAGATGATTTAAT^AAGTTCAAACAGACATGG 
TCATAAAAGGAAAAAATCTCGTTCTCGATCTCAGAGCAAGTCTCGGGATCACTCAGATGCAG 
CCAAGAAACACAGGCATGAAAGGGGACATCATAGGGACAGGCGTGAACGATCTCGCTCCTTT 
GAGAGGTCCCATAAAAGCAAGCACCATGGTGGCAGTCGCTCAGGACATGGCAGGCACAGGCG 
CTGACTTTCTCTTCCTTTGAGCCTGCATCAGTTCTTGGTTTTGCCTATCTACAGTGTGATGT 
ATGGACTCAATCAAAAACATTAAACGCAAACTGATTAGGATTTGATTTCTTGAAACCCTCTA 
GGTCTCTAGAACACTGAGGACAGTTTCTTTTGAAAAGAACTATGTTAATTTTTTTGCACATT 
AAAATGCCCTAGCAGTATCTAATTAAAAACCATGGTCAGGTTCAATTGTACTTTATTATAGT 
TGTGTATTGTTTATTGCTATAAGAACTGGAGCGTGAATTCTGTAAAAATGTATCTTATTTTT 
ATACAGATAAAATTGCAGACACTGTTCTATTTAAGTGGTTATTTGTTTAAATGATGGTGAAT 
ACTTTCTTAACACTGGTTTGTCTGCATGTGTAAAGATTTTTACAAGGAAATAAAATACAAAT 
CTTGTTTTTTCTAAAAAAAAAAAAAAAAAAGT 
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FIGURE 141 

MNDSLRTNVFVRFQPETIACAC 

TRKKPNYELLEKEVEKRKVALQEAKLKAKGLNPDGTPALSTLGGFSPASKPSSPREVKAEEK 
SPISINVKTVKKEPEDRQQASKSPYNGVRKDSKRSRNSRSASRSRSRTRSRSRSHTPRRHYN 
NRRSRSGTYSSRSRSRSRSHSESPRRHHNHGSPHLKAKHTRDDLKSSNRHGHKRKKSRSRSQ 
SKSRDHSDAAKKHRHERGHHRDRRERSRSFERSHKSKHHGGSRSGHGRHRR 
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FIGURE 142 

TGGGGATAAAGGAAAAATGGTCAGGTATTAATGGCTTAAAGATTATTGGAAGGGGTTTATCA 
TTTTTTGAANNTATTCGGGTCANAATTGNCTTTGAAAAGCATTGCTTTTTACAGAAATATAT 
TANCTTTTTAGAGTAATTTCTAGTTTGGATTGTAATATGAAATTATTTAAAAGGGCTTCGCT 
CATATATAGGAAAATCGCATATGGTCCTAGTATTAAATTNTTATTGCTTACTGATTTTTTTG 
AGTTAAGAGTTGTTATATGNTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAAAAGA 
ATAAAGTAGATTGAGTCTCCAATTTTATGTAAGCTTCAGAAGAACTGGTTTGTTTACATGCA 
AGCTTATAGTTGAAATATTTTTCAGGAATTACATGAATGACAGTCTTCGAACCAATGTGTTT 
GTTCGATTTCAACCAGAGANTATAGCATGTGCTTGCATCTACCTTGCAGNTAGAGCACTTCA 
GATTCCGTTGCCAACTNGTCCCCATTGGTTTCTTCTTTTTGGTACTACAGAAGAGGAAATCC 
AGGAAATNTGCATAGAAACACTTAGGCTTTATACCAGAAAAAAGCCAAACTATGAATTACTG 
GAAAAAGAAGT AGAAAAAAG AAAAGTAGCC TTAC AAGAAG CCNAATT AAAAG C AAAGGGATT 
GAATCCGGATGGAACTCCAGCCCTTTCAACCCTGGGTGGATTTTCTCC 
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FIGURE 143 

GGCACGAGGCCTCGTGCCAAGCTTGGCACGAGGGTGCACCGCGTTCTCGCACGCGTCATGGC 
GGTCCTCGGAGTACAGCTGGTGGTGACCCTGCTCACTGCCACCCTCATGCACAGGCTGGCGC 
CACACTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAGTTTGTTCCGATACAAGCACCCG 
TCTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCCCAGAGGCAGGAAAGAGCGGTG 
GGCCAATGGCCTTAGTGAGGAGAAGCCACTGTCTGTGCCCCGAGATGCGCCGTTCCAGCTGG 
AGACCTGCCCCCTCACGACCGTGGATGCCCTGGTCCTGCGCTTCTTCCTGGAGTACCAGTGG 
TTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTTCACAGAGGCCTACTACTACAT 
GCTGGGACCAGCCAAGGAGACTAACATTGCTGTGTTCTGGTGCCTGCTCACGGTGACCTTCT 
CCATCAAGATGTTCCTGACAGTGACACGGCTGTACTTCAGCGCCGAGGAGGGGGGTGAGCGC 
TCTGTCTGCCTCACCTTTGCCTTCCTCTTCCTGCTGCTGGCCATGCTGGTGCAAGTGGTGCG 
GGAGGAGACCCTCGAGCTGGGCCTGGAGCCTGGTCTGGCCAGCATGACCCAGAACTTAGAGC 
CACTTCTGAAGAAGCAGGGCTGGGACTGGGCGCTTCCTGTGGCCAAGCTGGCTATCCGCGTG 
GGACTGGCAGTGGTGGGCTCTGTGCTGGGTGCCTTCCTCACCTTCCCAGGCCTGCGGCTGGC 
CCAGACCCACCGGGACGCACTGACCATGTCGGAGGACAGACCCATGCTGCAGTTCCTCCTGC 
ACACCAGCTTCCTGTCTCCCCTGTTCATCCTGTGGCTCTGGACAAAGCCCATTGCACGGGAC 
TTCCTGCACCAGCCGCCGTTTGGGGAGACGCGTTTCTCCCTGCTGTCCGATTCTGCCTTCGA 
CTCTGGGCGCCTCTGGTTGCTGGTGGTGCTGTGCCTGCTGCGGCTGGCGGTGACCCGGCCCC 
ACCTGCAGGCCTACCTGTGCCTGGCCAAGGCCCGGGTGGAGCAGCTGCGAAGGGAGGCTGGC 
CGCATCGAAGCCCGTGAAATCCAGCAGAGGGTGGTCCGAGTCTACTGCTATGTGACCGTGGT 
GAGCTTGCAGTACCTGACGCCGCTCATCCTCACCCTCAACTGCACACTTCTGCTCAAGACGC 
TGGGAGGCTATTCCTGGGGCCTGGGCCCAGCTCCTCTACTATCCCCCGACCCATCCTCAGCC 
AGCGCTGCCCCCATCGGCTCTGGGGAGGACGAAGTCCAGCAGACTGCAGCGCGGATTGCCGG 
GGCCCTGGGTGGCCTGCTTACTCCCCTCTTCCTCCGTGGCGTCCTGGCCTACCTCATCTGGT 
GGACGGCTGCCTGCCAGCTGCTCGCCAGCCTTTTCGGCCTCTACTTCCACCAGCACTTGGCA 
GGCTCCTAGCTGCCTGCAGACCCTCCTGGGGCCCTGAGGTCTGTTCCTGGGGCAGCGGGACA 
CTAGCCTGCCCCCTCTGTTTGCGCCCCCGTGTCCCCAGCTGCAAGGTGGGGCCGGACTCCCC 
GGCGTTCCCTTCACCACAGTGCCTGACCCGCGGCCCCCCTTGGACGCCGAGTTTCTGCCTCA 
GAACTGTCTCTCCTGGGCCCAGCAGCATGAGGGTCCCGAGGCCATTGTCTCCGAAGCGTATG 
TGCCAGGTTTGAGTGGCGAGGGTGATGCTGGCTGCTCTTCTGAACAAATAAAGGAGCATGCC 
GATTTTTAA 
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FIGURE 144 

MAVLGVQLVVTLLTATLMHRIAPHCSFARWLLCNGSLFRYKHPSEEELRALAGKPRPRGRKE 
RWANGLSEEKPLSVPRDAPFQLETCPLTTVDALVLRFFLEYQWFVDFAVYSGGVYLFTEAYY 
YMLGPAKETNIAVFWCLLTVTFSIKMFLTVTRLYFSAEEGGERSVCLTFAFLFLLLAMLVQV 
VREETLELGLEPGLASMTQNLEPLLKKQGWDWALPVAKLAIRVGLAWGSVLGAFLTFPGLR 
LAQTHRDALTMSEDRPMLQFLLHTSFLSPLFILWLWTKPIARDFLHQPPFGETRFSLLSDSA 
FDSGRLWLLVVLCLLRLAVTRPHLQAYLCIAKARVEQLRREAGRIEAREIQQRVVRVYCYVT 
WSLQYLTPLILTLNCTLLLKTLGGYSWGLGPAPLLSPDPSSASAAPIGSGEDEVQQTAARI 
AGALGGLLTPLFLRGVLAYLIWWTAACQLLASLFGLYFHQHLAGS 



4 
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FIGURE 145 

CGTTNGCACGCGTCAATGGCGGTCCTCGGAGTACAGCTGGTGGTGACCCTGCTCACTGCCAC 
CCTCATGCACAGGCTGGCGCCACACTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAGTT 
TGTTCCGATACAAGCACCCGTNTTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCC 
CAGAGGCAGGAAAGAGCGGTGGGCCAATGGCCTTAGTGAGGAGAAGCCACTGTCTGTGCCCC 
GAGATGCCCCGTTCCAGCTGGAGACCTGCCCCCTCACGACCGTGGATGCCCTGGTCCTGCGC 
TTCTTCCTGGAGTACCAGTGGTTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTT 
CACAGAGGCCTACTACTACATGCTGGGACCAGCCAAGGAGACTAACATTGCTGTGTTCTGGT 
GCCTGCTCACAGTGACCTTCTCCATCAAGATGTTCCTGACAGTGACACGGCTGTACTTCAGC 
GCCGAGGAGGGGGGTGAGCGCTCTGTCTGCCTCACCTTTGCCTTCCTCTTCCTGCTGCTGGC 
CATGCTGGTGCAAGCG 
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FIGURE 146 

GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTA 
TTAACGTGGCTTAATCTGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGC 
ATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGGTTTGGGCCCTTGTAGCTGACAGAAGGTGGC 
CAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGCTGGTCTTGCCTTGG 
CTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTG 
TAAAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTC 
CAGACGGCTGTGCGAGCCTCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACC 
ATCTCCTTAATGACAGACGAGCCTGGCCTAGACAACCCTGCCTACGTGTCCTCGGCAGAGGA 
CGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTAGGGCACGGCCCT 
TTGAGAGATCCACTATTAGAAGCAGATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTT 
CGAAGGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGCAGGGAAAATTCTGA 
AAACACCACTGCCCCTGAAGTCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTA 
CCAGCATCAAGATCAATCGAGTAGATCCCAGTGAAAGCCTCTCTATTAGGCTGGTGGGAGGT 
AGCGAAACCCCACTGGTCCATATCATTATCCAACACATTTATCGTGATGGGGTGATCGCCAG 
AGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATG 
TCCCTCACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTG 
ATGCGTGAACAGAAGTTCCGCAGCAGGAACAATGGACAGGCCCCGGATGCCTACAGACCCCG 
AGATGACAGCTTTCATGTGATTCTCAACAAAAGTAGCCCCGAGGAGCAGCTTGGAATAAAAC 
TGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCGGTGTGGCA 
TATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGGACATGATCTTCG 
ATATGGCAGCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCG 
TCGTGTCCCGCCAGGTTCGGCAGCGGAGCCCTGACATCTTTCAGGAAGCCGGCTGGAACAGC 
AATGGCAGCTGGTCCCCAGGGCCAGGGGAGAGGAGCAACACTCCCAAGCCCCTCCATCCTAC 
AATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCGGCATGA 
CCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTTGCCTATCTATGTCATCAGTGTTGAG 
CCCGGAGGAGTCATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAATGTGGA 
TGGGGTCGAACTGACAGAGGTCAGCCGGAGTGAGGCAGTGGCATTATTGAAAAGAACATCAT 
CCTCGATAGTACTCAAAGCTTTGGAAGTCAAAGAGTATGAGCCCCAGGAAGACTGCAGCAGC 
CCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCTGGGT 
CATGTGGCTGGAATTACCACGGTGCTTGTATAACTGTAAAGATATTGTATTACGAAGAAACA 
CAGCTGGAAGTCTGGGCTTCTGCATTGTAGGAGGTTATGAAGAATACAATGGAAACAAACCT 
TTTTTCATCAAATCCATTGTTGAAGGAACACCAGCATACAATGATGGAAGAATTAGATGTGG 
TGATATTCTTCTTGCTGTCAATGGTAGAAGTACATCAGGAATGATACATGCTTGCTTGGCAA 
GACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTTTT 
TTATAGAATCAATGATGGGTCAGAGGAAAACAGAAAAATCACAAATAGGCTAAGAAGTTGAA 
ACACTATATTTATCTTGTCAGTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTCAGG 
AAAAGTATGATCATCTAATGAAAGCCAGTTACACCTCAGAAAATATGATTCCAAAAAAATTA 
AAACTACTAGTTTTTTTTCAGTGTGGAGGATTTCTCATTACTCTACAACATTGTTTATATTT 
TTTCTATTCAATAAAAAGCCCTAAAACAACTAAAATGATTGATTTGTATACCCCACTGAATT 
CAAGCTGATTTAAATTTAAAATTTGGTATATGCTGAAGTCTGCCAAGGGTACATTATGGCCA 
TTTTTAATTTACAGCTAAAATATTTTTTAAAATGCATTGCTGAGAAACGTTGCTTTCATCAA 
ACAAGAATAAATATTTTTCAGAAGTTAAA 
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FIGURE 147 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTAT 
APSPEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRS 
FKKINRALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDP 
SESLSIRLVGGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLL 
RQPCQVLWLTVMREQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGV 
FIFNVLDGGVAYRHGQLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRS 
PDIFQEAGWNSNGSWSPGPGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHRE 
WDLPIYVISVEPGGVISRDGRIKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEV 
KEYEPQEDCSSPAALDSNHNI^PPSDWSPSWVl^LELPRCLYNCKDIVLRRNTAGSLGFCIV 
GGYEEYNGNKPFFIKSIVEGTPAYNDGRIRCGDILLAVNGRSTSGMIHACLARLLKELKGRI 
TLTIVSWPGTFL 
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FIGURE 148 

CCAAAGTGATCATTTGA7VAAAGAGATATCCACATCTTCAAGCCCATATAAAGGATAGAAGCT 
GCACAGGGCAGCTTTACTTACTCCAGCACCTTCCTCTCCCAGGCAAATGGTGCTGACCATCT 
TTGGGATACAATCTCATGGATACGAGGTTTTTAACATCATCAGCCCAAGCAACAATGGTGGC 
AATGTTCAGGAGACAGTGACAATTGATAATGAAAAAAATACCGCCATCGTTAACATCCATGC 
AGGATCATGCTCTTCTACCACAATTTTTGACTATAAACATGGCTACATTGCATCCAGGGTGC 
TCTCCCGAAGAGCCTGCTTTATCCTGAAGATGGACCATCAGAACATCCCTCCTCTGAACAAT 
CTCCAATGGTACATCTATGAGAAACAGGCTCTGGACAACATGTTCTCCAACAAATACACCTG 
GGTCAAGTACAACCCTCTGGAGTCTCTGATCAAAGACGTGGATTGGTTCCTGCTTGGGTCAC 
CCATTGAGAAACTCTGCAAACATATCCCTTTGTATAAGGGGGAAGTGGTTGAAAACACACAT 
AATGTCGGTGCTGGAGGCTGTGCAAAGGCTGGGCTCCTGGGCATCTTGGGAATTTCAATCTG 
TGCAGACATTCATGTTTAGGATGATTAGCCCTCTTGTTTTATCTTTTCAAAGAAATACATCC 
TTGGTTTACACTCAAAAGTCAAATTAAATTCTTTCCCAATGCCCCAACTAATTTTGAGATTC 
AGTCAGAAAATATAAATGCTGTATTTATA 
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FIGURE 149 

MKILVAFLWLTIFGIQSHGYEVFNIISPSNNGGNVQETVTIDNEKNTAIVNIHAGSCSSTT 
IFDYKHGYIASRVLSRRACFILKMDHQNIPPLNNLQV^IYEKQALDNMFSNKYTWVKYNPLE 
SL I KD VDWFLLGS P I E KLCKH I PL YKGE WENTHNVGAGGCAKAGLLG I LG I S I CAD I HV 
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FIGURE 150 

GGCACGAGCCAGGAACTAGGAGGTTCTCACTGCCCGAGCAGAGGCCCTACACCCACCGAGGC 
AT6GGGCTCCCTGGGCTGTTCTGCTTGGCCGTGCTGGCTGCCAGCAGCTTCTCCAAGGCACG 
GGAGGAAGAAATTACCCCTGTGGTCTCCATTGCCTACAAAGTCCTGGAAGTTTTCCCCAAAG 
GCCGCTGGGTGCTCATAACCTGCTGTGCACCCCAGCCACCACCGCCCATCACCTATTCCCTC 
TGTGGAACCAAGAACATCAAGGTGGCCAAGAAGGTGGTGAAGACCCACGAGCCGGCCTCCTT 
CAACCTCAACGTCACACTCAAGTCCAGTCCAGACCTGCTCACCTACTTCTGCCGGGCGTCCT 
CCACCTCAGGTGCCCATGTGGACAGTGCCAGGCTACAGATGCACTGGGAGCTGTGGTCCAAG 
CCAGTGTCTGAGCTGCGGGCCAACTTCACTCTGCAGGACAGAGGGGCAGGCCCCAGGGTGGA 
GATGATCTGCCAGGCGTCCTCGGGCAGCCCACCTATCACCAACAGCCTGATCGGGAAGGATG 
GGCAGGTCCACCTGCAGCAGAGACCATGCCACAGGCAGCCTGCCAACTTCTCCTTCCTGCCG 
AGCCAGACATCGGACTGGTTCTGGTGCCAGGCTGCAAACAACGCCAATGTCCAGCACAGCGC 
CCTCACAGTGGTGCCCCCAGGTGGTGACCAGAAGATGGAGGACTGGCAGGGTCCCCTGGAGA 
GCCCCATCCTTGCCTTGCCGCTCTACAGGAGCACCCGCCGTCTGAGTGAAGAGGAGTTTGGG 
GGGTTCAGGATAGGGAATGGGGAGGTCAGAGGACGCAAAGCAGCAGCCATG TAGA ATGAACC 
GTCCAGAGAGCCAAGCACGGCAGAGGACTGCAGGCCATCAGCGTGCACTGTTCGTATTTGGA 
GTTCATGCAAAATGAGTGTGTTTTAGCTGCTCTTGCCACAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 151 

MGLPGLFCLAVLAASSFSKAREEEITPWSIAYKVLEVFPKGRWVLITCCAPQPPPPITYSL 
CGTKNIKVAKKVVKTHEPASFNLNVTLKSSPDLLTYFCRASSTSGAHVDSARLQMHWELWSK 
PVSELRANFTLQDRGAGPRVEMICQASSGSPPITNSLIGKDGQVHLQQRPCHRQPANFSFLP 
SQTSDWFWCQAANNANVQHSALTWPPGGDQKMEDWQGPLESPILALPLYRSTRRLSEEEFG 
GFR I GNGE VRGRKAAAM 
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FIGURE 152 

GGTCCTTAATGGCAGCAGCCGCCGCTACCAAGATCCTTCTGTGCCTCCCGCTTCTGCTCCTG 
CTGTCCGGCTGGTCCCGGGCTGGGCGAGCCGACCCTCACTCTCTTTGCTATGACATCACCGT 
CATCCCTAAGTTCAGACCTGGACCACGGTGGTGTGCGGTTCAAGGCCAGGTGGATGAAAAGA 
CTTTTCTTCACTATGACTGTGGCAACAAGACAGTCACACCTGTCAGTCCCCTGGGGAAGAAA 
CTAAATGTCACAACGGCCTGGAAAGCACAGAACCCAGTACTGAGAGAGGTGGTGGACATACT 
TACAGAGCAACTGCGTGACATTCAGCTGGAGAATTACACACCCAAGGAACCCCTCACCCTGC 
AGGCAAGGATGTCTTGTGAGCAGAAAGCTGAAGGACACAGCAGTGGATCTTGGCAGTTCAGT 
TTCGATGGGCAGATCTTCCTCCTCTTTGACTCAGAGAAGAGAATGTGGACAACGGTTCATCC 
TGGAGCCAGAAAGATGAAAGAAAAGTGGGAGAATGACAAGGTTGTGGCCATGTCCTTCCATT 
ACTTCTCAATGGGAGACTGTATAGGATGGCTTGAGGACTTCTTGATGGGCATGGACAGCACC 
CTGGAGCCAAGTGCAGGAGCACCACTCGCCATGTCCTCAGGCACAACCCAACTCAGGGCCAC 
AGCCACCACCCTCATCCTTTGCTGCCTCCTCATCATCCTCCCCTGCTTCATCCTCCCTGGCA 
TCTGAGGAGAGTCCTTTAGAGTGACAGGTTAAAGCTGATACCAAAAGGCTCCTGTGAGCACG 
GTCTTGATCAAACTCGCCCTTCTGTCTGGCCAGCTGCCCACGACCTACGGTGTATGTCCAGT 
GGCCTCCAGCAGATCATGATGACATCATGGACCCAATAGCTCATTCACTGCCTTGATTCCTT 
TTGCCAACAATTTTACCAGCAGTTATACCTAACATATTATGCAATTTTCTCTTGGTGCTACC 
TGATGGAATTCCTGCACTTAAAGTTCTGGCTGACTAAACAAGATATATCATTTTCTTTCTTC 
TCTTTTTGTTTGGAAAATCAAGTACTTCTTTGAATGATGATCTCTTTCTTGCAAATGATATT 
GTCAGTAAAATAATCACGTTAGACTTCAGACCTCTGGGGATTCTTTCCGTGTCCTGAAAGAG 
AATTTTTAAATTATTTAATAAGAAAAAATTTATATTAATGATTGTTTCCTTTAGTAATTTAT 
TGTTCTGTACTGATATTTAAATAAAGAGTTCTATTTCCCAAAAAAAAAAAAAAAAAA 
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FIGURE 153 

MAAAAATKILLCLPLLLLLSGWSRAGRADPHSLCYDITVIPKFRPGPRWCAVQGQVDEKTFL 
HYDCGNKTVTPVS PLGKKLNVTTAWKAQNP VLRE WD I LTEQLRD I QLENYTPKEPLTLQAR 
MSCEQKAEGHSSGSWQFSFDGQIFLLFDSEKRMWTW 

MGDC I GWLEDFLMGMDSTLE PS AGAPLAMS SGTTQLRATATTL ILCCLLIILPCFILPGI 
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FIGURE 154 

GGGAAAGCCATTTCGAAAACCCATCTATACAAACTATATATTTTCATTTCTGCTGCTAGCTG 
CCTTGGGCCTCACAATTTTCATTCTGTTTTCTGACTTTCAAGTTATATACCGTGGAATGGAG 
TTGATCCCAACCATAACATCGTGGAGGGTTTTAATTTTGGTGGTAGCCCTCACCCAATTCTG 
GTGTGGCTTTCTTTGCAGAGGATTCCACCTTCAAAATCATGAACTCTGGCTGTTGATCAAAA 
GAGAATTTGGATTCTACTCTAAAAGTCAATATAGGACTTGGCAAAAGAAGCTAGCAGAAGAC 
TCAACCTGGCCTCCCATAAACAGGACAGATTATTCAGGTGATGGCAAAAATGGATTCTACAT 
CAACGGAGGCTATGAAAGCCATGAACAGATTCCAAAAAGAAAACTCAAATTGGGAGGCCAAC 
CCACAGAACAGCATTTCTGGGCCAGGCTGT^ATCAGAATTGTCGTCGTACATGCTCAACAGC 
ATTGCTTTTTTCCCCAAAATTAACACATTGTGGAGAAGTGATGATACTCTCCCCTTACCTTT 
CCTCTCTCCATTCAAGCATTCAAAGTATATTTTCAATGAATTAAACCTTGCAGCAAGGGACC 
TTAGATAGGCTTATTCTGACTGTATGCTTTACCAATGAGAGAAAAAAATGCATTTCCTGTAT 
CATCCTTTTCAATAAACTGTATTCATTTTGAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 155 



MELIPTITSWRVLILWALTQFWCGFLCRGFHLQNHELWLLIKREFGFYSKSQYRTWQKKLA 
EDSTWPPINRTDYSGDGKNGFYINGGYESHEQIPKRKLKLGGQPTEQHFWARL 
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FIGURE 156 

GTTCTCCTTTCCGAGCCAAAATCCC^GGCGATGGTGAATTATGAACGTGCCACACCATGAAG 
CTCTTGTGGCAGGTAACTGTGCACCACCACACCTGGAATGCCATCCTGCTCCCGTTCGTCTA 
CCTCACGGCGCAAGTGTGGATTCTGTGTGCAGCCATCGCTGCTGCCGCCTCAGCCGGGCCCC 
AGAACTGCCCCTCCGTTTGCTCGTGCAGTAACCAGTTCAGCAAGGTGGTGTGCACGCGCCGG 
GGCCTCTCCGAGGTCCCGCAGGGTATTCCCTCGAACACCCGGTACCTCAACCTCATGGAGAA 
CAACATCCAGATGATCCAGGCCGACACCTTCCGCCACCTCCACCACCTGGAGGTCCTGCAGT 
TGGGCAGGAACTCCATCCGGCAGATTGAGGTGGGGGCCTTCAACGGCCTGGCCAGCCTCAAC 
ACCCTGGAGCTGTTCGACAACTGGCTGACAGTCATCCCTAGCGGGGCCTTTGAATACCTGTC 
CAAGCTGCGGGAGCTCTGGCTTCGCAACAACCCCATCGAAAGCATCCCCTCTTACGCCTTCA 
ACCGGGTGCCCTCCCTCATGCGCCTGGACTTGGGGGAGCTCAAGAAGCTGGAGTATATCTCT 
GAGGGAGCTTTTGAGGGGCTGTTCAACCTCAAGTATCTGAACTTGGGCATGTGCAACATTAA 
AGACATGCCCAATCTCACCCCCCTGGTGGGGCTGGAGGAGCTGGAGATGTCAGGGAACCACT 
TCCCTGAGATCAGGCCTGGCTCCTTCCATGGCCTGAGCTCCCTCAAGAAGCTCTGGGTCATG 
AACTCACAGGTCAGCCTGATTGAGCGGAATGCTTTTGACGGGCTGGCTTCACTTGTGGAACT 
CAACTTGGCCCACAATAACCTCTCTTCTTTGCCCCATGACCTCTTTACCCCGCTGAGGTACC 
TGGTGGAGTTGCATCTACACCACAACCCTTGGAACTGTGATTGTGACATTCTGTGGCTAGCC 
TGGTGGCTTCGAGAGTATATACCCACCAATTCCACCTGCTGTGGCCGCTGTCATGCTCCCAT 
GCACATGCGAGGCCGCTACCTCGTGGAGGTGGACCAGGCCTCCTTCCAGTGCTCTGCCCCCT 
TCATCATGGACGCACCTCGAGACCTCAACATTTCTGAGGGTCGGATGGCAGAACTTAAGTGT 
CGGACTCCCCCTATGTCCTCCGTGAAGTGGTTGCTGCCCAATGGGACAGTGCTCAGCCACGC 
CTCCCGCCACCCAAGGATCTCTGTCCTCAACGACGGCACCTTGAACTTTTCCCACGTGCTGC 
TTTCAGACACTGGGGTGTACACATGCATGGTGACCAATGTTGCAGGCAACTCCAACGCCTCG 
GCCTACCTCAATGTGAGCACGGCTGAGCTTAACACCTCCAACTACAGCTTCTTCACCACAGT 
AACAGTGGAGACCACGGAGATCTCGCCTGAGGACACAACGCGAAAGTACAAGCCTGTTCCTA 
CCACGTCCACTGGTTACCAGCCGGCATATACCACCTCTACCACGGTGCTCATTCAGACTACC 
CGTGTGCCCAAGCAGGTGGCAGTACCCGCGACAGACACCACTGACAAGATGCAGACCAGCCT 
GGATGAAGTCATGAAGACCACCAAGATCATCATTGGCTGCTTTGTGGCAGTGACTCTGCTAG 
CTGCCGCCATGTTGATTGTCTTCTATAAACTTCGTAAGCGGCACCAGCAGCGGAGTACAGTC 
ACAGCCGCCCGGACTGTTGAGATAATCCAGGTGGACGAAGACATCCCAGCAGCAACATCCGC 
AGCAGCAACAGCAGCTCCGTCCGGTGTATCAGGTGAGGGGGCAGTAGTGCTGCCCACAATTC 
ATGACCATATTAACTACAACACCTACAAACCAGCACATGGGGCCCACTGGACAGAAAACAGC 
CTGGGGAACTCTCTGCACCCCACAGTCACCACTATCTCTGAACCTTATATAATTCAGACCCA 
TACCAAGGACAAGGTACAGGAAACTCAAATATGACTCCCCTCCCCCAAAAAACTTATAAAAT 
GCAATAGAATGCACACAAAGACAGCAACTTTTGTACAGAGTGGGGAGAGACTTTTTCTTGTA 
TATGCTTATATATTAAGTCTATGGGCTGGTTAAAAAAAACAGATTATATTAAAATTTAAAGA 
CAAAAAGTCAAAACA 
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FIGURE 157 

MKLLWQVTVHHHTmAILLPFVYLTAQWILCAAIAAAASAGPQNCPSVCSCSNQFSKVVCT 

RRGLSEVPQGIPSNTRYLNLMENNIQMIQADTFRHLHHLEVLQLGRNSIRQIEVGAFNGLAS 

LNTLELFDNWLWIPSGAFEYLSKLRELWLRNNPIESIPSYAFNRVPSLMRLDLGELKKLEY 

ISEGAFEGLFNLKYLNLGMCNIKDMPNLTPLVGLEELEMSGNHFPEIRPGSFHGLSSLKKLW 

VmSQVSLIERNAFDGLASLVELNLAHNNLSSLPHDLFTPLRYLVEI^ 

LAWWLREYIPTNSTCCGRCHAPMH^GRYLVEVDQASFQCSAPFIMDAPRDLNISEGRMAEL 

KCRTPPMSSVKWLLPNGTVLSHASRHPRISVLNDGTLNFSHVLLSDTGWTCIWTNVAGNSN 

ASAYLNVSTAELNTSNYSFFTTVTVETTEISPEDTTRKYKPVPTTSTGYQPAYTTSTTVLIQ 

TTRVPKQVAVPATDTTDKMQTSLDEVMKTTKI I IGCFVAVTLLAAAMLIVFYKLRKRHQQRS 

TVTAARTVE I IQVDED I PAATSAAATAAPSGVSGEGAWLPT IHDH INYNTYKPAHGAHWTE 

NSLGNSLHPTVTTISEPYI I QTHTKDKVQETQ I 
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FIGURE 158 

CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTT 

TTCTTCAGCTCCTTCTCATCTCGTCCTTGCCAAGAGAGTACACAGTCATTAATGAAGCCTGC 

CCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTGAATATGATCAGATTGAGTGCGT 

CTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAATG 

AGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGA 

AATGGCTCATGGGGGGGTACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTG 

CCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTGGCCAGGTTCTGCGAGCCCCAAAGG 

GTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGGACCATTCATGCTAAA 

CCTGGGTTTGTCATCCAACTAAGATTTGTCATGTTGAGTCTGGAGTTTGACTACATGTGCCA 

GTATGACTATGTTGAGGTTCGTGATGGAGACAACCGCGATGGCCAGATCATCAAGCGTGTCT 

GTGGCAACGAGCGGCCAGCTCCTATCCAGAGCATAGGATCCTCACTCCACGTCCTCTTCCAC 

TCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGCATGCTC 

CTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTG 

CCTGCTTGGCAGGCTATACTGGGCAGCGCTGTGAAAATCTCCTTGAAGAAAGAAACTGCTCA 

GACCCTGGGGGCCCAGTCAATGGGTACCAGAAAATAACAGGGGGCCCTGGGCTTATCAACGG 

ACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGTTCTTAGTG 

GCAATGAGAAAAGAACTTGCCAGCAGAATGGAGAGTGGTCAGGGAAACAGCCCATCTGCATA 

AAAGCCTGCCGAGAACCAAAGATTTCAGACCTGGTGAGAAGGAGAGTTCTTCCGATGCAGGT 

TCAGTCAAGGGAGACACCATTACACCAGCTATACTCAGCGGCCTTCAGCAAGCAGAAACTGC 

AGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGGATACCAACAT 

CTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAG 

GAGGACATGTCTGAGGACTGGGAAGTGGAGTGGGCGGGCACCATCCTGCATCCCTATCTGCG 

GGAAAATTGAGAACATCACTGCTCCAAAGACCCAAGGGTTGCGCTGGCCGTGGCAGGCAGCC 

ATCTACAGGAGGACCAGCGGGGTGCATGACGGCAGCCTACACAAGGGAGCGTGGTTCCTAGT 

CTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACC 

TGGGGAAGGTCACCATGATCAAGACAGCAGACCTGAAAGTTGTTTTGGGGAAATTCTACCGG 

GATGATGACCGGGATGAGAAGACCATCCAGAGCCTACAGATTTCTGCTATCATTCTGCATCC 

CAACTATGACCCCATCCTGCTTGATGCTGACATCGCCATCCTGAAGCTCCTAGACAAGGCCC 

GTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTC 

CAGGAGTCCCACATCACTGTGGCTGGCTGGAATGTCCTGGCAGACGTGAGGAGCCCTGGCTT 

CAAGAACGACACACTGCGCTCTGGGGTGGTCAGTGTGGTGGACTCGCTGCTGTGTGAGGAGC 

AGCATGAGGACCATGGCATCCCAGTGAGTGTCACTGATAACATGTTCTGTGCCAGCTGGGAA 

CCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCC 

GGGACGAGCATCTCCTGAGCCACGCTGGCATCTGATGGGACTGGTCAGCTGGAGCTATGATA 

AAACATGCAGCCACAGGCTCTCCACTGCCTTCACCAAGGTGCTGCCTTTTAAAGACTGGATT 

GAAAGAAATATGAAATGAACCATGCTCATGCACTCCTTGAGAAGTGTTTCTGTATATCCGTC 

TGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGG 

CTGTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGC 

TGGTAGGCTGATGCCGCGTCCACTACTAGGACAGCCAATTGGAAGATGCCAGGGCTTGCAAG 

AAGTAAGTTTCTTCAAAGAAGACCATATACAAAACCTCTCCACTCCACTGACCTGGTGGTCT 

TCCCCAACTTTCAGTTATACGAATGCCATCAGCTTGACCAGGGAAGATCTGGGCTTCATGAG 

GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGC 

TGGGATGTGGTGCATGCCTTTGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCC 

ATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCTGAACTACAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAT^AAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 159 

MELGCWTQLGLTFLQLLLISSLPREYWINEACPGAEWNIMCRECCEYDQIECVCPGKREVV 
GYTIPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGD 
G^CGQVLRAPKGQILLESYPLNAHCEWTI 

GDNRDGQIIKRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDG 
TCVLDKAGSYKCACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTV 
VSFFCNNS YVLSGNEKRTCQQNGEWSGKQP I C I KACRE PKI SDLVRRRVLPMQVQSRETPLH 
QLYSAAFSKQKLQSAPTKKPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGK 
WSGRAPSCIPICGKIENITAPKTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNE 
RT VWAAH C VTDLGKVTM I KTADLKWLGKF YRDDDRDEKT I QS LQ I S AI I LHPNYD P I LLD 
ADIAILKLLDKARISTRVQPICI^SRDLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSG 
WSVVDSLLCEEQHEDHGIPVSVTDNMFCASWEPTAPSDICTAETGGIAAVSFPGRASPEPR 
WHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWIERNMK 
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ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGA 
AGCTTTCTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGC 
TTCAACCTGACTTTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCT 
GACAGTGGTTGGGTGGGCCACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAG 
CAAAGGAGTTCATGGCTAATTTCCATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACT 
AATGAAGCATCCACGAAGAAGGTAGAACTTGACAACTGTCCTTCTGTGTCTCCTTACCTCAG 
AGGCCAGAGCAAGCTCATTTTCAAACCAGATCTC^CTTTGGAAGAGGTACAGGCAGAAAATC 
CCAAAGTGTCCAGAGGCCGGTATCGCCCTC^GGAATGTAAAGCTTTA 

GTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAACATCTGCATCCCTTCCT 
GCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGGTAAAAAGTTTA 
ATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGGACTGC 
TTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGA 
GCATCCCT^GCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGAT 
ATTTTGGGGGTGTTACTGCCCTAAGCAGAGi^GCAGTTTTTCAAGGTGAATGGATTCTCTAAC 
AACTACTGGGGATGGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAAT 
GAAAATTTCCCGGCCCCTGCCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACA 
AAGGCAATGAGGTGAACGCAGAACGGATGAAGCTCTTACACCAAGTGTCACGAGTCTGGAGA 
ACAGATGGGTTGAGTAGTTGTTCTTATAAATTAGTATCTGTGGAACACAATCCTTTATATAT 
CAACATCACAGTGGATTTCTGGTTTGGTGCATGACCCTGGATCTTTTGGTGATGTTTGGAAG 
AACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAAATAGTAGCACACATTAAGA 
ACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCTTAGCAGAGCTCCT 
GGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTCTTAGTCATTTTGATCATGAGG 
GTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAAA 
ATGAACGCTATTTGAGGACTCTGGTTGAAGGAGATTTATTTAAATTTGAAGTAATATATTAT 
GGGATAAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGT 
CCAAGGTAGAAAGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTG 
AAGTGGTGGTGTCAGGTGAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGA 
CACAGTGAACTTGGGAATGAAGAGGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAG 
TAGCTGAGCTGGTTGCAGGTGCTGATAGCCTTCAGGGGAGGACCTGCCCAGGTATGCCTTCC 
AGTGATGCCCACCAGAGAATACATTCTCTATTAGTTTTTAAAGAGTTTTTGTAAAATGATTT 
TGTACAAGTAGGATATGAATTAGCAGTTTACAAGTTTACATATTAACTAATAATAAATATGT 
CTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 
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FIGURE 161 

MG FNLTFHL S YKFRLLLLLTLCLTWGW ATSNYF VG A I QE I P KAKE FMANFHKTL I LG KGKT 
LTNEASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRV 
AILVPHRNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEEN 
WDCFIFHDVDLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNG 
FSNNYWGWGGEDDDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSR 
VWRTDGLSSCSYKLVSVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites: 

amino acids 4-7, 220-223 and 335-338 



Xylose isomerase proteins: 

amino acids 191-201 
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CGTGGGCCGGGGTCGCGCAGCGGGCTGTGGGCGCGCCCGGAGGAGCGACCGCCGCAGTTCTC 
GAGCTCCAGCTGCATTCCCTCCGCGTCCGCCCCACGCTTCTCCCGCTCCGGGCCCCGCAATG 
GCCCAGGCAGTGTGGTCGCGCCTCGGCCGCATCCTCTGGCTTGCCTGCCTCCTGCCCTGGGC 
CCCGGCAGGGGTGGCCGCAGGCCTGTATGAACTCAATCTCACCACCGATAGCCCTGCCACCA 
CGGGAGCGGTGGTGACCATCTCGGCCAGCCTGGTGGCCAAGGACAACGGCAGCCTGGCCCTG 
CCCGCTGACGCCCACCTCTACCGCTTCCACTGGATCCACACCCCGCTGGTGCTTACTGGCAA 
GATGGAGAAGGGTCTCAGCTCCACCATCCGTGTGGTCGGCCACGTGCCCGGGGAATTCCCGG 
TCTCTGTCTGGGTCACTGCCGCTGACTGCTGGATGTGCCAGCCTGTGGCCAGGGGCTTTGTG 
GTCCTCCCCATCACAGAGTTCCTCGTGGGGGACCTTGTTGTCACCCAGAACACTTCCCTACC 
CTGGCCCAGCTCCTATCTCACTAAGACCGTCCTGAAAGTCTCCTTCCTCCTCCACGACCCGA 
GCAACTTCCTCAAGACCGCCTTGTTTCTCTACAGCTGGGACTTCGGGGACGGGACCCAGATG 
GTGACTGAAGACTCCGTGGTCTATTATAACTATTCCATCATCGGGACCTTCACCGTGAAGCT 
CAAAGTGGTGGCGGAGTGGGAAGAGGTGGAGCCGGATGCCACGAGGGCTGTGAAGCAGAAGA 
CCGGGGACTTCTCCGCCTCGCTGAAGCTGCAGGAAACCCTTCGAGGCATCCAAGTGTTGGGG 
CCCACCCTAATTCAGACCTTCCAAAAGATGACCGTGACCTTGAACTTCCTGGGGAGCCCTCC 
TCTGACTGTGTGCTGGCGTCTCAAGCCTGAGTGCCTCCCGCTGGAGGAAGGGGAGTGCCACC 
CTGTGTCCGTGGCCAGCACAGCGTACAACCTGACCCACACCTTCAGGGACCCTGGGGACTAC 
TGCTTCAGCATCCGGGCCGAGAATATCATCAGCAAGACACATCAGTACCACAAGATCCAGGT 
GTGGCCCTCCAGAATCCAGCCGGCTGTCTTTGCTTTCCCATGTGCTACACTTATCACTGTGA 
TGTTGGCCTTCATCATGTACATGACCCTGCGGAATGCCACTCAGCAAAAGGACATGGTGGAG 
AACCCGGAGCCACCCTCTGGGGTCAGGTGCTGCTGCCAGATGTGCTGTGGGCCTTTCTTGCT 
GGAGACTCCATCTGAGTACCTGGAAATTGTTCGTGAGAACCACGGGCTGCTCCCGCCCCTCT 
ATAAGTCTGTCAAAACTTACACCGTGTGAGCACTCCCCCTCCCCACCCCATCTCAGTGTTAA 
CTGACTGCTGACTTGGAGTTTCCAGCAGGGTGGTGTGCACCACTGACCAGGAGGGGTTCATT 
TGCGTGGGGCTGTTGGCCTGGATCATCCATCCATCTGTACAGTTCAGCCACTGCCACAAGCC 
CCTCCCTCTCTGTCACCCCTGACCCCAGCCATTCACCCATCTGTACAGTCCAGCCACTGACA 
TAAGCCCCACTCGGTTACCACCCCCTTGACCCCCTACCTTTGAAGAGGCTTCGTGCAGGACT 
TTGATGCTTGGGGTGTTCCGTGTTGACTCCTAGGTGGGCCTGGCTGCCCACTGCCCATTCCT 
CTCATATTGGCACATCTGCTGTCCATTGGGGGTTCTCAGTTTCCTCCCCCAGACAGCCCTAC 
CTGTGCCAGAGAGCTAGAAAGAAGGTCATAAAGGGTTAAAAATCCATAACTAAAGGTTGTAC 
ACATAGATGGGCACACTCACAGAGAGAAGTGTGCATGTACACACACCACACACACACACACA 
CACACACACACAGAAATATAAACACATGCGTCACATGGGCATTTCAGATGATCAGCTCTGTA 
TCTGGTTAAGTCGGTTGCTGGGATGCACCCTGCACTAGAGCTGAAAGGAAATTTGACCTCCA 
AGCAGCCCTGACAGGTTCTGGGCCCGGGCCCTCCCTTTGTGCTTTGTCTCTGCAGTTCTTGC 
GCCCTTTATAAGGCCATCCTAGTCCCTGCTGGCTGGCAGGGGCCTGGATGGGGGGCAGGACT 
AATACTGAGTGATTGCAGAGTGCTTTATAAATATCACCTTATTTTATCGAAACCCATCTGTG 
AAACTTTCACTGAGGAAAAGGCCTTGCAGCGGTAGAAGAGGTTGAGTCAAGGCCGGGCGCGG 
TGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACGAGATCAGGA 
GATCGAGACCACCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAAAATACAAAAAGTT 
AGCCGGGCGTGGTGGTGGGTGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATG 
GTGCGAACCCGGGAGGCGGAGCTTGCAGTGAGCCCAGATGGCGCCACTGCACTCCAGCCTGA 
GTGACAGAGCGAGACTCTGTCTCCA 
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MAQAWSRLGRILWLACLLPWAPAGVAAGLYELNLTTDSPATTGAVVTISASLVAKDNGSLA 
LPADAHLYRFHWIHTPLVLTGKMEKGLSSTIRWGHVPGEFPVSVWWAADCWMCQPVARGF 
VVLPITEFLVGDLVVTQNTSLPWPSSYLTKTVLKVSFLLHDPSNFLKTALFLYSWDFGDGTQ 
MVTEDSVVYYNYSIIGTFTVKLKWAEWEEVEPDATRAVKQKTGDFSASLKLQETLRGIQVL 
GPTLIQTFQKMTWLNFLGSPPLTVCWRLKPECLPLEEGECHPVSVASTAYNLTHTFRDPGD 
YCFSIRAENIISKTHQYHKIQWPSRIQPAVFAFPCATLITVMIAFIMYMTLRNATQQKDW 
ENPEPPSGVRCCCQMCCGPFLLETPSEYLEIVRENHGLLPPLYKSVKTYTV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domain: 

amino acids 33 9-3 62 

N-glycosylation sites. 

amino acids 34-37, 58-61, 142-145, 197-200, 300-303 and 364-367 
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FIGURE 164 

GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCT 

GGGCCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCA 

CAACAGACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTG 

GATGCCCATGTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCT 

GCGGCTGCTGTCATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTG 

CCCCCGTCCCCTCCCTTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAA 

TGGCTGGTTCTTTTGTTTTCCT^AAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 165 



MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTH 
FPICIFCCGCCHRSKCGMCCKT 
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CTGTCAGGAAGGACCATCTGAAGGCTGCAATTTGTTCTTAGGGAGGCAGGTGCTGGCCTGGC 

CTGGATCTTCCACCATGTTCCTGTTGCTGCCTTTTGATAGCCTGATTGTCAACCTTCTGGGC 

ATCTCCCTGACTGTCCTCTTCACCCTCCTTCTCGTTTTCATCATAGTGCCAGCCATTTTTGG 

AGTCTCCTTTGGTATCCGCAAACTCTACATGAAAAGTCTGTTAAAAATCTTTGCGTGGGCTA 

CCTTGAGAATGGAGCGAGGAGCCAAGGAGAAGAACCACCAGCTTTACAAGCCCTACACCAAC 

GGAATCATTGCAAAGGATCCCACTTCACTAGAAGAAGAGATCAAAGAGATTCGTCGAAGTGG 

TAGTAGTAAGGCTCTGGACAACACTCCAGAGTTCGAGCTCTCTGACATTTTCTACTTTTGCC 

GGAAAGGAATGGAGACCATTATGGATGATGAGGTGACAAAGAGATTCTCAGCAGAAGAACTG 

GAGTCCTGGAACCTGCTGAGCAGAACCAATTATAACTTCCAGTACATCAGCCTTCGGCTCAC 

GGTCCTGTGGGGGTTAGGAGTGCTGATTCGGTACTGCTTTCTGCTGCCGCTCAGGATAGCAC 

TGGCTTTCACAGGGATTAGCCTTCTGGTGGTGGGCACAACTGTGGTGGGATACTTGCCAAAT 

GGGAGGTTTAAGGAATTCATGAGTAAACATGTTCACTTAATGTGTTACCGGATCTGCGTGCG 

AGCGCTGACAGCCATCATCACCTACCATGACAGGGAAAACAGACCAAGAAATGGTGGCATCT 

GTGTGGCCAATCATACCTCACCGATCGATGTGATCATCTTGGCCAGCGATGGCTATTATGCC 

ATGGTGGGTCAAGTGCACGGGGGACTCATGGGTGTGATTCAGAGAGCCATGGTGAAGGCCTG 

CCCACACGTCTGGTTTGAGCGCTCGGAAGTGAAGGATCGCCACCTGGTGGCTAAGAGACTGA 

CTGAACATGTGCAAGATAAAAGCAAGCTGCCTATCCTCATCTTCCCAGAAGGAACCTGCATC 

AATAATACATCGGTGATGATGTTCAAAAAGGGAAGTTTTGAAATTGGAGCCACAGTTTACCC 

TGTTGCTATCAAGTATGACCCTCAATTTGGCGATGCCTTCTGGAACAGCAGCAAATACGGGA 

TGGTGACGTACCTGCTGCGAATGATGACCAGCTGGGCCATTGTCTGCAGCGTGTGGTACCTG 

CCTCCCATGACTAGAGAGGCAGATGAAGATGCTGTCCAGTTTGCGAATAGGGTGAAATCTGC 

CATTGCCAGGCAGGGAGGACTTGTGGACCTGCTGTGGGATGGGGGCCTGAAGAGGGAGAAGG 

TGAAGGACACGTTCAAGGAGGAGCAGCAGAAGCTGTACAGCAAGATGATCGTGGGGAACCAC 

AAGGACAGGAGCCGCTCCTGAGCCTGCCTCCAGCTGGCTGGGGCCACCGTGCGGGGTGCCAA 

CGGGCTCAGAGCTGGAGTTGCCGCCGCCGCCCCCACTGCTGTGTCCTTTCCAGACTCCAGGG 

CTCCCCGGGCTGCTCTGGATCCCAGGACTCCGGCTTTCGCCGAGCCGCAGCGGGATCCCTGT 

GCACCCGGCGCAGCCTACCCTTGGTGGTCTAAACGGATGCTGCTGGGTGTTGCGACCCAGGA 

CGAGATGCCTTGTTTCTTTTACAATAAGTCGTTGGAGGAATGCCATTAAAGTGAACTCCCCA 

CCTTTGCACGCTGTGCGGGCTGAGTGGTTGGGGAGATGTGGCCATGGTCTTGTGCTAGAGAT 

GGCGGTACAAGAGTCTGTTATGCAAGCCCGTGTGCCAGGGATGTGCTGGGGGCGGCCACCCG 

CTCTCCAGGAAAGGCACAGCTGAGGCACTGTGGCTGGCTTCGGCCTCAACATCGCCCCCAGC 

CTTGGAGCTCTGCAGACATGATAGGAAGGAAACTGTCATCTGCAGGGGCTTTCAGCAAAATG 

AAGGGTTAGATTTTTATGCTGCTGCTGATGGGGTTACTAAAGGGAGGGGAAGAGGCCAGGTG 

GGCCGCTGACTGGGCCATGGGGAGAACGTGTGTTCGTACTCCAGGCTAACCCTGAACTCCCC 

ATGTGATGCGCGCTTTGTTGAATGTGTGTCTCGGTTTCCCCATCTGTAATATGAGTCGGGGG 

GAATGGTGGTGATTCCTACCTCACAGGGCTGTTGTGGGGATTAAAGTGCTGCGGGTGAGTGA 

AGGACACATCACGTTCAGTGTTTCAAGTACAGGCCCACAAAACGGGGCACGGCAGGCCTGAG 

CTCAGAGCTGCTGCACTGGGCTTTGGATTTGTTCTTGTGAGTAAATAAAACTGGCTGGTGAATGA 
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FIGURE 167 

MFLLLPFDSLI VNLLG I SLTVLFTLLLVFI I VPA I FGVSFG I RKLYMKSLLK I FAWATLRME 
RGAKEKNHQLYKPYTNGIIAKDPTSLEEEIKEIRRSGSSKALDNTPEFELSDIFYFCRKGME 
TIMDDEWKRFSAEELESWNLLSRTNYNFQYISLRLTVLWGLGVLIRYCFLLPLRIALAFTG 
I SLLWGTT WG YLPNGRFKEFMS KHVHLMCYR I CVRALTAI I TYHDRENRPRNGG I CVANH 
TSPIDVIILASDGYYAMVGQVHGGLMGVIQRAMVKACPHVWFERSEVKDRHLVAKRLTEHVQ 
DKSKLPILIFPEGTCINNTSVl^FKKGSFEIGATWPVAIKYDPQFGDAFWNSSKYGMVTYL 
LRMMTSWAIVCSWYLPPMTREADEDAVQFANRVKSAIARQGGLVDLLWDGGLKREKVKDTF 
KEEQQKLYSKMIVGNHKDRSRS 
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FIGURE 168 

GCCCCTCGAAACCAGGACTCCAGCACCTCTGGTCCCGCCCTCACCCGGACCCCTGGCCCTCA 
CGTCTCCTCCAGGGATGGCGCTGGCGGCTTTGATGATCGCCCTCGGCAGCCTCGGCCTCCAC 
ACCTGGCAGGCCCAGGCTGTTCCCACCATCCTGCCCCTGGGCCTGGCTCCAGACACCTTTGA 
CGATACCTATGTGGGTTGTGCAGAGGAGATGGAGGAGAAGGCAGCCCCCCTGCTAAAGGAGG 
AAATGGCCCACCATGCCCTGCTGCGGGAATCCTGGGAGGCAGCCCAGGAGACCTGGGAGGAC 
AAGCGTCGAGGGCTTACCTTGCCCCCTGGCTTCAAAGCCCAGAATGGAATAGCCATTATGGT 
CTACACCAACTCATCGAACACCTTGTACTGGGAGTTGAATCAGGCCGTGCGGACGGGCGGAG 
GCTCCCGGGAGCTCTACATGAGGCACTTTCCCTTCAAGGCCCTGCATTTCTACCTGATCCGG 
GCCCTGCAGCTGCTGCGAGGCAGTGGGGGCTGCAGCAGGGGACCTGGGGAGGTGGTGTTCCG 
AGGTGTGGGCAGCCTTCGCTTTGAACCCAAGAGGCTGGGGGACTCTGTCCGCTTGGGCCAGT 
TTGCCTCCAGCTCCCTGGATAAGGCAGTGGCCCACAGATTTGGGGAGAAGAGGCGGGGCTGT 
GTGTCTGCGCCAGGGGTGCAGCTAGGGTCACAATCTGAGGGGGCCTCCTCTCTGCCCCCCTG 
GAAGACTCTGCTCTTGGCCCCTGGAGAGTTCCAGCTCTCAGGGGTTGGGCCCTGAAAGTCCA 
ACATCTGCCACTTAGGAGCCCTGGGAACGGGTGACCTTCATATGACGAAGAGGCACCTCCAG 
CAGCCTTGAGAAGCAAGAACATGGTTCCGGACCCAGCCCTAGCAGCCTTCTCCCCAACCAGG 
ATGTTGGCCTGGGGAGGCCACAGCAGGGCTGAGGGAACTCTGCTATGTGATGGGGACTTCCT 
GGGACAAGCAAGGAAAGTACTGAGGCAGCCACTTGATTGAACGGTGTTGCAATGTGGAGACA 
TGGAGTTTTATTGAGGTAGCTACGTGATTAAATGGTATTGCAGTGTGGA 
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FIGURE 169 

MALAALMI ALGS LGLHTWQAQAVPT I LPLGLAPDTFDDTYVGCAEEMEEKAAPLLKEEMAHH 
ALLRESWEAAQETWEDKRRGLTLPPGFKAQNGIAIIWYTNSSNTLYWELNQAVRTGGGSREL 
YMRHFPFKALHFYLIRALQLLRGSGGCSRGPGEWFRGVGSLRFEPKRLGDSVRLGQFASSS 
LDKAVAHRFGEKRRGCVSAPGVQLGSQSEGASSLPPWKTLLLAPGEFQLSGVGP 
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FIGURE 170 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCA 
CCCTCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTG 
GTCGGTTCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTC 
TATTGTCTGGACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCA 
TAGTGACCCAAAATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAG 
CTCAGCAAACTGAAGAAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACT 
CCAGCAGCCCTCCACCCAGGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAG 
TCACCATGGGTCTGCAGAGCAATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATG 
GAACATGGGGAAGAGGATGTGATTTATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTC 
CCATAATGGGTCCATCCTCCCCATCTCCTGGAGATGGGGAGAAAGTGATATGACCTTCATCT 
GCGTTGCCAGGAACCCTGTCAGCAGAAACTTCTCAAGCCCCATCCTTGCCAGGAAGCTCTGT 
GAAGGTGCTGCTGATGACCCAGATTCCTCCATGGTCCTCCTGTGTCTCCTGTTGGTGCCCCT 
CCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTGGTTTCTGAAGAGAGAGAGACAAGAAG 
AGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGGAAACTCCTAACATATGCCGCCAT 
TCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAATAGAACAATCCTAAAGGAAGA 
TCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAAAGATGGAAAATCCCCACTCAC 
TGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTATCTAGACAGCAGTG 
CACTCCCCTAAGTCTCTGCTCA 
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FIGURE 171 

MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVT 
IQPEGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHV 
YEHLSKPKVTMGLQSNKNGTCVTNLTCCMEHGEEDV I YTWKALGQAANESHNGS I LP I SWRW 
GESDMTFICVARNPVSRNFSSPILARKLCEGAADDPDSSIWLLCLLLVPLLLSLFVLGLFLW 
FLKRERQEEYIEEKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIP 
KKMENPHSLLTMPDTPRLFAYENVI 
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FIGURE 172 

CTGGTTCCCCAACATGCCTCACCCTCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCC 
TCTGGACCCGTGAAAGAGCTGGTCGGTTCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTC 
CAAAGTAAAGCAAGTTGACTCTATTGTCTGGACCTTCAACACAACCCCTCTTGTCACCATAC 
AGCCAGAAGGGGGCACTATCATAGTGACCCAAAATCGTAATAGGGAGAGAGTAGACTTCCCA 
GATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAGAAGAATGACTCAGGGATCTACTATGT 
GGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCAGGAGTACGTGCTGCATGTCTACG 
AGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCAATAAGAATGGCACCTGTGTG 
ACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATTTATACCTGGAAGGCCCT 
GGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTCCTGGAGATGGGGAG 
AAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACTTCTCAAGCCCC 
ATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATGGTCCTCCT 
GTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTGGTTTC 
TGAAGAGAGAGAGACAAGAAGAGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGGAA 
ACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAA 
TAGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAA 
AGATGGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAG 
AATGTTATCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCAAAAAAAAAAAAAAAAAAA 
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FIGURE 173 

GAAAQACQTGGTCCTGACAQACAGACAATCCTATTCCCTACCAA AATG AAGATGCTGCTGCT 
GCTGTGTTTGGGACTGACCCTAGTCTGTGTCCATGCAGAAGAAGCTAGTTCTACGGGAAGGA 
ACTTTAATGTAGAAAAGATTAATGGGGAATGGCATACTATTATCCTGGCCTCTGACAAAAGA 
GAAAAGATAGAAGAACATGGCAACTTTAGACTTTTTCTGGAGCAAATCCATGTCTTGGAGAA 
TTCCTTAGTTCTTAAAGTCCATACTGTAAGAGATGAAGAGTGCTCCGAATTATCTATGGTTG 
CTGACAAAACAGAAAAGGCTGGTGAATATTCTGTGACGTATGATGGATTCAATACATTTACT 
ATACCTAAGACAGACTATGATAACTTTCTTATGGCTCACCTCATTAACGAAAAGGATGGGGA 
AACCTTCCAGCTGATGGGGCTCTATGGCCGAGAACCAGATTTGAGTTCAGACATCAAGGAAA 
GGTTTGCACAACTATGTGAGGAGCATGGAATCCTTAGAGAAAATATCATTGACCTATCCAAT 
GCCAATCGCTGCCTCCAGGCCCGAGAATGAAGAATGGCCTGAGCCTCCAGTGTTGAGTGGAC 
ACTTCTCACCAGGACTCCACCATCATCCCTTCCTATCCATACAGCATCCCCAGTATAAATTC 
TGTGATCTGCATTCCATCCTGTCTCACTGAGAAGTCCAATTCCAGTCTATCAACATGTTACC 
TAGGATACCTCATCAAGAATCAAAGACTTCTTTAAATTTCTCTTTGATACACCCTTGACAAT 
TTTTCATGAAATTATTCCTCTTCCTGTTCAATAAATGATTACCCTTGCACTTAA 
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FIGURE 174 

MKMLLLLCLGLTLVCVHAEEASSTGRNFNVEKINGEWHTI ILASDKREKIEEHGNFRLFLEQ 
IHVLENSLVLKVHTVRDEECSELSIWADKT^ 

NEKDGETFQLMGLYGREPDLSSDIKERFAQLCEEHGILRENIIDLSNANRCLQARE 
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FIGURE 175 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAA 
TGGATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTG 
TCAGCTTAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTC 
CCAGGAATTATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAG 
AAAAAGAGCGTGCTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCA 
CAGTCATTGGTGCTCTGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTC 
ATGTGTAATTCTCCAAGCAACAGTAATGCCAATTGTGAATTTTCATTGAAAAACATCAGTGA 
CATTCATCCAGAATCCTTCAACTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTG 
GTTT CAAT AAAC CC AC CAGT AACG AC AC CATGGCGAGTGGCTGG AGAGC ATCTAGTTT CC AC 
TTCGATTCTGAAGAAAACAAACATAGGCTTATCCACTTCTCAGTATTTTTAGGTCTATTGCT 
TGTTGGAATTCTGGAGGTCCTGTTTGGGCTCAGTCAGATAGTCATCGGTTTCCTTGGCTGTC 
TGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTGTGTAGTTTAATGGGAATAAAATGTAAGTA 
TCAGTAGTTTGAAAAAAAAAAA 
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FIGURE 176 

MTCCEGWTSCNGFSLLVLLLLGVVLNAIPLIVSLVEEDQFSQNPISCFEVWFPGIIGAGLl^ 
I PATTMSLTARKRACCNNRTGMFLSS FFS VITVI GALYCMLI S I QALLKGPLMCNS PSNSNA 
NCEFSLKNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTI^SGWRASSFHFDSEENKHRL 
IHFSVFLGLLLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 
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FIGURE 177 



PCT/US99/12252 



GTCGAATCCAAATCACTCATTGTGAAAGCTGAGCTCACAGCCGAATAAGCCACCATGAGGCT 
GTCAGTGTGTCTCCTGATGGTCTCGCTGGCCCTTTGCTGCTACCAGGCCCATGCTCTTGTCT 
GCCCAGCTGTTGCTTCTGAGATCACAGTCTTCTTATTCTTAAGTGACGCTGCGGTAAACCTC 
CAAGTTGCCAAACTTAATCCACCTCCAGAAGCTCTTGCAGCCAAGTTGGAAGTGAAGCACTG 
CACCGATCAGATATCTTTTAAGAAACGACTCTCATTGAAAAAGTCCTGGTGGAAATAGTGAA 
AAAATGTGGTGTGTGACATGTAAAAATGCTCAACCTGGTTTCCAAAGTCTTTCAACGACACC 
CTGATCTTCACTAAAAATTGTAAAGGTTTCAACACGTTGCTTTAATAAATCACTTGCCCTGC 
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FIGURE 178 

MRLSVCLLIWSLALCCYQAHALVC^ 
KHCTDQ I S FKKRLSLKKS WWK 
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FIGURE 179 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGG 
TGAAGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTG 
GAGCAGATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTC 
GAACTGTGACATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGG 
AAGCCAATGACCCATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAG 
CTGAGCGGACTGATCTGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGG 
CAAATGCAAATACAAGAGCAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCAC 
TCATCACTCCAGGCTCTGCCACTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGA 
AGCCTAACACTGGCCCCCAGCACCTCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTC 
TCTCCAAGGGCAGGCTGTTAGGCCCCTTTCTGATCAGGAGGCTTCTTTATGAATTAAACTCG 
CCCCACCACCCCCTCA 
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FIGURE 180 

MERVTIJUjLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCK 



YKSSQKQHS PVPEKAI PLI TPGSATTC 
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FIGURE 181 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCC 
TGGCTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTC 
CTGGCTACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCC 
AGTGTTTCCCACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCT 
ACAGAGGAGGGCTTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGT 
ATGGCTGGGTCCCATCATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCA 
CCAATGCCTCAGCTGCCATTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGG 
CTGGGAGAAGGGATACTGCTGAGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGAC 
GCCCGCCTTCCATTTCAACATCCTGAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACA 
TCATGCTTGACAAGTGGCAGCACCTGGCCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAG 
CACATCAGCCTCATGACCTTGGACAGTCTACAGAAATGCATCTTCAGCTTTGACAGCCATTG 
TCAGGAGAGGCCCAGTGAATATATTGCCACCATCTTGGAGCTCAGTGCCCTTGTAGAGAAAA 
GAAGCCAGCATATCCTCCAGCACATGGACTTTCTGTATTACCTCTCCCATGACGGGCGGCGC 
TTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGACGCTGTCATCCGGGAGCGGCGTCG 
CACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAAAGCCAAGTCCAAGACTTTGG 
ATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGGCATTGTCAGATGAGGAT 
ATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACGGCCAGTGGCCTCTC 
CTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCGACAGGAGGTGC 
AAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCCAGCTGCCC 
TTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATCTCCCG 
ATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTGCC 
TCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGAC 
CCCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTC 
CGCAGGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGG 
CGTTGATGCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAA 
TTGATCATGCGCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCA 
GT^CTTTCTGACCCATCCACCTGTTTTTT 
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FIGURE 182 

MSLLSLPWLGLRPVAMSPWLLLLLWGSWL 

HLGLITPTEEGLKDSTQMSATYSQGFTVWLGP 1 1 PF I VLCHPDT IRS I TNASAAI APKDNLF 
IRFLKPWLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGS 
SRLDMFEHISLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYY 
LSHDGRRFHRACRLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDG 
KALSDED I RAEADTFMFGGHDTTASGLSWVLYNLARHPE YQERCRQE VQELLKDRDPKE I EW 
DDLAQLPFLTMCVKESLRLHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVW 
PDPEWDPFRFDPENSKGRSPLAFJPFSAGPRNCIGQAFAMAEMKVVLALMLLHFRFLPDHT 
EPRRKLELIMRAEGGLWLRVEPLNVGLQ 
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FIGURE 183 

CAACAGAAGCCAAGAAGGAAGCCGTCTATCTTGTGGCGATCATGTATAAGCTGGCCTCCTGC 
TGTTTGCTTTTCACAGGATTCTTAAATCCTCTCTTATCTCTTCCTCTCCTTGACTCCAGGGA 
AATATCCTTTCAACTCTCAGCACCTCATGAAGACGCGCGCTTAACTCCGGAGGAGCTAGAAA 
GAGCTTCCCTTCTACAGATATTGCCAGAGATGCTGGGTGCAGAAAGAGGGGATATTCTCAGG 
AAAGCAGACTGAAGTACCAACATTTTTAACCCAAGAGGAAATTTGAGAAAGTTTCAGGATTT 
CTCTGGACAAGATCCTAACATTTTACTGAGTCATCTTTTGGCCAGAATCTGGAAACCATACA 
AGAAACGTGAGACTCCTGATTGCTTCTGGAAATACTGTGTCTGAAGTGAAATAAGCATCTGT 
TAGTCAGCTCAGAAACACCCATCTTAGAATATGAAAAATAACACAATGCTTGATTTGAAAAC 
AGTGTGGAGAAAAACTAGGCAAACTACACCCTGTTCATTGTTACCTGGAAAATAAATCCTCT 
ATGTTTTGCACAAAAAAAAAAAAAAA 
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FIGURE 184 

MYKLASCCLLFTGFLNPLLSLPLLDSREISFQLSAPHEDARLTPEELERASLLQILPEMLGA 
ERGDILRKADSSTNIFNPRGNLRKFQDFSGQDPNILLSHLLARIWKPYKKRETPDCFWKYCV 
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FIGURE 185 

GAACATTTTTAGTTCCCAAGGAATGTACATCAGCCCCACGGAAGCTAGGCCACCTCTGGGAT 
GGGGTTGCTGGTTTAAAACAAACGCCAGTCATCCTATATAAGGACCTGACAGCCACCAGGCA 
CCACCTCCGCCAGGAACTGCAGGCCCACCTGTCTGCAACCCAGCTGAGGCC ATG CCCTCCCC 
AGGGACCGTCTGCAGCCTCCTGCTCCTCGGCATGCTCTGGCTGGACTTGGCCATGGCAGGCT 
CCAGCTTCCTGAGCCCTGT^ACACCAGAGAGTCCAGCAGAGAAAGGAGTCGAAGAAGCCACCA 
GCCAAGCTGCAGCCCCGAGCTCTAGCAGGCTGGCTCCGCCCGGAAGATGGAGGTCAAGCAGA 
AGGGGCAGAGGATGAACTGGAAGTCCGGTTCAACGCCCCCTTTGATGTTGGAATCAAGCTGT 
CAGGGGTTCAGTACCAGCAGCACAGCCAGGCCCTGGGGAAGTTTCTTCAGGACATCCTCTGG 
GAAGAGGCCAAAGAGGCCCCAGCCGACAAGTGATCGCCCACAAGCCTTACTCACCTCTCTCT 
AAGTTTAGAAGCGCTCATCTGGCTTTTCGCTTGCTTCTGCAGCAACTCCCACGACTGTTGTA 
CAAGCTCAGGAGGCGAATAAATGTTCAAACTGTA 
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FIGURE 186 

MPSPGTVCSLLLLGMLWLDLAMAGSSFLSPEHQRVQQRKESKKPPAKLQPRALAGWLRPEDG 
GQAEGAEDELEVRFNAPFDVGIKLSGVQYQQHSQALGKFLQDILWEEAKEAPADKO 
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JO/i\ O 
FIGURE 187 

CGGCCACAGCTGGCATGCTCTGCCTGATCGCCATCCTGCTGTATGTCCTCGTCCAGTACCTC 
GTGAACCCCGGGGTGCTCCGCACGGACCCCAGATGTCAAGAATATGAACACGTGGCTGCTGT 
TCCTCCCCCTGTTCCCGGTGCAGGTGCAGACCCTGATAGTCGTGATCATCGGGATGCTCGTG 
CTCCTGCTGGACTTT CTTGGCTTGGTGC AC CTGGGC CAGCTG CT CATC TT CCAC AT CTACCT 
GAGTATGTCCCCCACCCTAAGCCCCCGATCCCCCCAAGGCTGGGTGGTCAGAGCTGCTCATC 
TTACACCTCTACTTGAGTATGTCCCTAACCCTGAGCCCCCCACGCCTGGGGCCAGAGTCTTT 
GTCCCCCGTGTGCGCATGTGTTCAGGGTCAGCCTCTCCCAGAAGTGAGATCATGGACAAAAA 
GGGCAAATCACAGGAAGAAATTAAATCCATGAGGACCCAGCAGGCCCAGCAAGAAGCTGAAC 
TCACGCCGAGACCTGCAGGAGTGGTGCCAGGTGCTTGAAGTAACAAGTTTAAAATGTTCAGA 
GACAATGGAATGGAATCTATTAGGCAAGAACAGGACATTATGAAATAAGGACAGGTGGACTT 
CCAAAAACACAAGTAGAAATTCTAACAATGAAATATATTACAGGCAGGTCACCCACTAACCA 
AACAACTGAAGCGAGAGCTGTGGTCTTGCTTGGTCTCACAGTGGGCACAGCGGTAGGCGGTC 
AGTCATGTTGCTGAACGACGGAGGGTAAACTCCCCAGCCCCAAGAAAACCTGTGTTGGAAGT 
AACAACAACCTCCCTGCTCCTGGCACCAGCCGTTTTGGTCATGGTGGGCCAGCTGCAAAGCG 
TCTTCCATTCTCTGGGCAGTGGTGGCCCCGAGGCTGTGGCCTCTCAGGGGGTTTCTGTGGAC 
ACGGGCAGCAGAGTGTGTCCAGGCCAGCCCCCAAGAATGCCCTGCTCCTGACAGCTTGGCCA 
ACCCCTGGTCAGGGCAGAGGGAGTTGGGTGGGTCAGGCTCTGGGCTCACCTCCATCTCCAGA 
GCATCCCCTGCCTGCAGTTGTGGCAAGAACGCCCAGCTCAGAATGAACACACCCCACCAAGA 
GCCTCCTTGTTCATAACCACAGGTTACCCTACAAACCACTGTCCCCACACAACCCTGGGGAT 
GTTTTAAAACACACACCTCTAACGCATATCTTACAGTCACTGTTGTCTTGCCTGAGGGTTGA 
ATTTTTTTTAATGAAAGTGCAATGAAAATCACTGGATTAAATCCTACGGACACAGAGCTGAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 188 

MNTWLLFLPLFPVQVQTLIWIIGMLVLLLDFLGLVHLGQLLIFHIYLSMSPTLSPRSPQGW 
WRAAHLTPLLEYVPNPEPPTPGARVFVPRVRMCSGSASPRSEIMDKKGKSQEEIKSMRTQQ 
AQQEAELTPRPAGWPGA 
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FIGURE 189 

GGAGTGCAGATGGCATCCTTCGGTTCTTCCAGACAAGCTGCAAGACGCTGACCATGGCCAAG 

ATGGAGCTCTCGAAGGCCTTCTCTGGCCAGCGGACACTCCTATCTGCCATCCTCAGCATGCT 

ATCACTCAGCTTCTCCACAACATCCCTGCTCAGCAACTACTGGTTTGTGGGCACACAGAAGG 

TGCCCAAGCCCCTGTGCGAGAAAGGTCTGGCAGCCAAGTGCTTTGACATGCCAGTGTCCCTG 

GATGGAGATACCAACACATCCACCCAGGAGGTGGTACAATACAACTGGGAGACTGGGGATGA 

CCGGTTCTCCTTCCGGAGCTTCCGGAGTGGCATGTGGCTATCCTGTGAGGAAACTGTGGAAG 

AACCAGGGGAGAGGTGCCGAAGTTTCATTGAACTTACACCACCAGCCAAGAGAGGTGAGAAA 

GGACTACTGGAATTTGCCACGTTGCAAGGCCCATGTCACCCCACTCTCCGATTTGGAGGGAA 

GCGGTTGATGGAGAAGGCTTCCCTCCCCTCCCCTCCCTTGGGGCTTTGTGGCAAAAATCCTA 

TGGTTATCCCTGGGAACGCAGATCACCTACATCGGACTTCAATTCATCAGCTTCCTCCTGCT 

ACTAACAGACTTGCTACTCACTGGGAACCCTGCCTGTGGGCTCAAACTGAGCGCCTTTGCTG 

CTGTTTCCTCTGTCCTGTCAGGTCTCCTGGGGATGGTGGCCCACATGATGTATTCACAAGTC 

TTCCAAGCGACTGTCAACTTGGGTCCAGAAGACTGGAGACCACATGTTTGGAATTATGGCTG 

GGCCTTCTACATGGCCTGGCTCTCCTTCACCTGCTGCATGGCGTCGGCTGTCACCACCTTCA 

ACACGTACACCAGGATGGTGCTGGAGTTCAAGTGCAAGCATAGTAAGAGCTTCAAGGAAAAC 

CCGAACTGCCTACCACATCACCATCAGTGTTTCCCTCGGCGGCTGTCAAGTGCAGCCCCCAC 

CGTGGGTCCTTTGACCAGCTACCACCAGTATCATAATCAGCCCATCCACTCTGTCTCTGAGG 

GAGTCGACTTCTACTCCGAGCTGCGGAACAAGGGATTTCAAAGAGGGGCCAGCCAGGAGCTG 

AAAGAAGCAGTTAGGTCATCTGTAGAGGAAGAGCAGTGTTAGGAGTTAAGCGGGTTTGGGGA 

GTAGGCTTGAGCCCTACCTTACACGTCTGCTGATTATCAACATGTGCTTAAGCCAACATCCG 

TCTCTTGAGCATGGTTTTTAGAGGCTACGAATAAGGCTATGAATAAGGGTTATCTTTAAGTC 

CTAAGGGATTCCTGGGTGCCACTGCTCTCTTTTCCTCTACAGCTCCATCTTGTTTCACCCAC 

CCCACATCTCACACATCCAGAATTCCCTTCTTTACTGATAGTTTCTGTGCCAGGTTCTGGGC 

TAAACCATGGAGATAAAAAGAAGAGTAAAATACACTTCCCGACCTTAAGGATCTGAAA 
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FIGURE 190 

MAKMELSKAFSGQRTLLSAILSMLSLSFSTTSLLSNYWFVGTQKVPKPLCEKGLAAKCFDMP 
VSLDGDTNTSTQEWQYNWETGDDRFSFRSFRSGMWLSCEETVEEPGERCRSFIELTPPAKR 
GEKGLLEFATLQGPCHPTLRFGGKRLMEKASLPSPPLGLCGKNPMVIPGNADHLHRTSIHQL 
PPATNRLATHWEPCLWAQTERLCCCFLCPVRSPGDGGPHDVFTSLPSDCQLGSRRLETTCLE 
LWLGLLHGLALLHLLHGVGCHHLQHVHQDGAGVQVQA 
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FIGURE 191 

AACTGGAAGGAAAGAAAGAAAGGTCAGCTTTGGCCCAGATGTGGTTACCCCTTGGTCTCCTG 

TCTTTATGTCTTTCTCCTCTTCCTATTCTGTCATCTCCCTCACTTAAGTCTCAGGCCTGTCA 

GCAGCTCCTGTGGACATTGCCATCCCCTCTGGTAGCCTTCAGAGCAAACAGGACAACCTATG 

TTATGGATGTTTCCACCAACCAGGGTAGTGGCATGGAGCACCGTAACCATCTGTGCTTCTGT 

GATCTCTATGACAGAGCCACTTCTCCACCTCTGAAATGTTCCCTGCTCTGAAATCTGGCATG 

AGATGGCACAGGTGACCACGCAGAAGCCACCAGAATCTTGCCTGCCCTATTCCTCCTCCCAA 

GTCTGTTCTCTTATTGTCAACCTCAGCACAACAGGCTGGCGCCAATGGCATTACAGAGAAAG 

CAATCTGTGTGGCTAGTGGGCAGATTACCATGCAAGCCCCAGGAGAAATGGAGGAGCTTTGT 

AGCCACCTCCCTGTCAGCCAGTATTAACATGTCCCCTTCCCCCTGCCCCGCCGTAGATTCAG 

GACATTCGCCCCTGTGTGCCACCAAACCAGGACTTTCCCCTTGGCTTGGCATCCCTGGCTCT 

CTCCTGGTACCCAGCAAGACGTCTGTTCCAGGGCAGTGTAGCATCTTTCAAGCTCCGTTACT 

ATGGCGATGGCCATGATGTTACAATCCCACTTGCCTGAATAATCAAGTGGGAAGGGGAAGCA 

GAGGGAAATGGGGCCATGTGAATGCAGCTGCTCTGTTCTCCCTACCCTGAGGAAAAACCAAA 

GGGAAGCAACAGGAACTTCTGCAACTGGTTTTTATCGGAAAGATCATCCTGCCTGCAGATGC 

TGTTGAAGGGGCACAAGAAATGTAGCTGGAGAAGATTGATGAAAGTGCAGGTGTGTAAGGAA 

ATAGAACAGTCTGCTGGGAGTCAGACCTGGAATTCTGATTCCAAACTCTTTATTACTTTGGG 

AAGTCACTCAGCCTCCCCGTAGCCATCTCCAGGGTGACGGAACCCAGTGTATTACCTGCTGG 

AACCAAGGAAACTAACAATGTAGGTTACTAGTGAATACCCCAATGGTTTCTCCAATTATGCC 

CATGCCACCAAAACAATAAAACAAAATTCTCTAACACTGAAA 
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MWLPLGLLSLCLSPLPILSSPSLKSQACQQLLWTLPSPLVAFRANRTTYVMDVSTNQGSGME 
HRNHLCFCDLYDRATSPPLKCSLL 
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GTAGCGCGTCTTGGGTCTCCCGGCTGCCGCTGCTGCCGCCGCCGCCTCGGGTCGTGGAGCCA 
GGAGCGACGTCACCGCCATGGCAGGCATCAAAGCTTTGATTAGTTTGTCCTTTGGAGGAGCA 
ATCGGACTGATGTTTTTGATGCTTGGATGTGCCCTTCCAATATACAACAAATACTGGCCCCT 
CTTTGTTCTATTTTTTTACATCCTTTCACCTATTCCATACTGCATAGCAAGAAGATTAGTGG 
ATGATACAGATGCTATGAGTAACGCTTGTAAGGAACTTGCCATCTTTCTTACAACGGGCATT 
GTCGTGTCAGCTTTTGGACTCCCTATTGTATTTGCCAGAGCACATCTGATTGAGTGGGGAGC 
TTGTGCACTTGTTCTCACAGGAAACACAGTCATCTTTGCAACTATACTAGGCTTTTTCTTGG 
TCTTTGGAAGCAATGACGACTTCAGCTGGCAGCAGTGG TGA AAAGAAATTACTGAACTATTG 
TCAAATGGACTTCCTGTCATTTGTTGGCCATTCACGCACACAGGAGATGGGGCAGTTAATGC 
TGAATGGTATAGCAAGCCTCTTGGGGGTATTTTAGGTGCTCCCTTCTCACTTTTATTGTAAG 
CATACTATTTTCACAGAGACTTGCTGAAGGATTAAAAGGATTTTCTCTTTTGGAAAAGCTTG 
ACTGATTTCACACTTATCTATAGTATGCTTTTTGTGGTGTCCTGCTGAATTTAAATATTTAT 
GTGTTTTTCCTGTTAGGTTGATTTTTTTTGGAATCAATATGCAATGTTAAACACTTTTTTAA 
TGTAATCATTTGCATTGGTTAGGAATTCAGAATTCCGCCGGCTCTATTACTGGTCAAGTACA 
TCTTTTCTCTTAAAATTATTTAGCCTCCATTATTACAAAAAATTATAAAAATAAGTTTTCAG 
TCAGTCAGGATGACATCACTCCCAATGTTATGCAGACATACAGACGGTTGGCATACGTTATA 
GACTGTATACTCAGTGCAAATATAGCTGCATTTATACCTCAGAGGGGCCAAGTGTTAATGCC 
CATGCCCTCCGTTAAGGGTTGTTGGTTTTACTGGTAGACAGATGTTTTGTGGATTGAAAATT 
ATTTTATGGAATTGCTACAGAGGAGTGCTTTTCTTCTCAATTGTTAGAAGAATTTATGTTAA 
ACTTTAAGGTAAGGGTGTAAAAACATTTTTGAGATAAGGTTTTTATTTATGTTTATTATTGT 
TAGAGTGAGTTGCAATGTGGGAAGAAATGACATTGAAATTCCAGTTTTTGAATCCTGTTTCT 
ATTTATAAGTGAAATTTGTGATCTCCTATCAACCTTTCATGTTTTACCCTGTTAAAATGGAC 
ATACATGGAACCACTACTGATGAGGGACAGTTGTATGTTTGCATCATATATGCCAGAAAACC 
TTCCTCTGCTTCCTCCTTTTGACTTATTTGGTATGTTGTATATATTACATAAAATAACTTTT 
CAAATATAGTTTAATAACACTTAGAAGTGTTTACTTACCTGGAAAATAATTGCTATGCCGTA 
CATTCAGAGTGCCCCCTCCCCTGCAAGGCCTTGCCATGATTAACAAGTAACTTGTTAGTCTT 
ACAGATAATTCATGCATTAACAGTTTAAGATTTAGACCATGGTAATAGTAGTTCTTATTCTC 
TAAGGTTATATCATATGTAATTTAAAAGTATTTTTAAGACAAGTTTCCTGTATACCTCTGAA 
CTGTTTTGATTTTGAGTTCATCATGATAGATCTGCTGTTTCCTTATAAAAGGCATTTGTTGT 
GTGAGTTAATGCAAAGTAGCCAAGTCCAGCTATATAGCAGCTTCAGAAACATACCTGACCAA 
AAAATTCCCAGTAACCAGGCATGATCAATTTATAGTGGTCGTTTACATCTAATAATTATCAG 
GACTTTTTTCAGGAGTGGGTTATAAAAACATTCAAGTTGGTCTGACAGTATTTTGTTAAGGA 
TATTTGTTTGTATGTTTATTCAGTATACTTACATAAAAATTATTTCGCCATCAGCCAAAACT 
CAGTAATC ATGACAGCTGTCTGTTGTTTTATGAAGTTTATTT CT CAAGAAAATGGGAATAAA 
TTTGGGATTTGTTCAGCTTTTTTACTAAAGATGCCTAAAGCCACAGGTTTTATTGCCTAACT 
TAAGCCATGACTTTTAGATATGAGATGACGGGAAGCAGGACGAAATATCGGCGTGTGGCTGG 
AGCCTTCCCACTGGAGGCTGAAAGTGGCTTGTGGTATTATAATGTTCAGATTTCAAGAGGAA 
GGTGCAGGTACACATGAGTTAGAGAGCTGGTGAGACAGTTGGGAACTCTTTGTGCTTGTGAT 
CTACTGGACTTTTTTTTTGCAGGAAGTGCATTCTCTGGTCCTTCCCTATTTTCTGTTCTGGA 
TGTCAGTGCAGTGCACTGCTACTGTTTTATCCACTTGGCCACAGACTTTTTCTAACAGCTGC 
GTATTATTTCTATATACTAATTGCATTGGCAGCATTGTGTCTTTGACCTTGTATACTAGCTT 
GACATAGTGCTGTCTCTGATTTCTAGGCTAGTTACTTGAGATATGAATTTTCCATAGAATAT 
GCACTGATACAACATTACCATTCTTCTATGGAAAGAAAACTTTTGATGATGAAACAATAAAG 
ATTTTAAATATCTATTTTAAAAAAAAAA 
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FIGURE 195A 

CCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCC 
CACGCGTCCGCCCACGCGTCCGGTGCAAGCTCGCGCCGCACACTGCCTGGTGGAGGGAAGGA 
GCCCGGGCGCCTCTCGCCGCTCCCCGCGCCGCCGTCCGCACCTCCCCACCGCCCGCCGCCCG 
CCGCCCGCCGCCCGCAAAGCATGAGTGAGCCCGCTCTCTGCAGCTGCCCGGGGCGCGAATGG 
CAGGCTGTTTCCGCGGAGTAAAAGGTGGCGCCGGTCAGTGGTCGTTTCCAATGACGGACATT 
AACCAGACTGTCAGATCCTGGGGAGTCGCGAGCCCCGAGTTTGGAGTTTTTTCCCCCCACAA 
CGTCACAGTCCGAACTGCAGAGGGAAAGGAAGGCGGCAGGAAGGCGAAGCTCGGGCTCCGGC 
ACGTAGTTGGGAAACTTGCGGGTCCTAGAAGTCGCCTCCCCGCCTTGCCGGCCGCCCTTGCA 
GCCCCGAGCCGAGCAGCAAAGTGAGACATTGTGCGCCTGCCAGATCCGCCGGCCGCGGACCG 
GGGCTGCCTCGGAAACACAGAGGGGTCTTCTCTCGCCCTGCATATAATTAGCCTGCACACAA 
AGGGAGCAGCTGAATGGAGGTTGTCACTCTCTGGAAAAGGATTTCTGACCGAGCGCTTCCAA 
TGGACATTCTCCAGTCTCTCTGGAAAGATTCTCGCTA ATG GATTTCCTGCTGCTCGGTCTCT 
GTCTATACTGGCTGCTGAGGAGGCCCTCGGGGGTGGTCTTGTGTCTGCTGGGGGCCTGCTTT 
CAGATGCTGCCCGCCGCCCCCAGCGGGTGCCCGCAGCTGTGCCGGTGCGAGGGGCGGCTGCT 
GTACTGCGAGGCGCTCAACCTCACCGAGGCGCCCCACAACCTGTCCGGCCTGCTGGGCTTGT 
CCCTGCGCTACAACAGCCTCTCGGAGCTGCGCGCCGGCCAGTTCACGGGGTTAATGCAGCTC 
ACGTGGCTCTATCTGGATCACAATCACATCTGCTCCGTGCAGGGGGACGCCTTTCAGAAACT 
GCGCCGAGTTAAGGAACTCACGCTGAGTTCCAACCAGATCACCCAACTGCCCAACACCACCT 
TCCGGCCCATGCCCAACCTGCGCAGCGTGGACCTCTCGTACAACAAGCTGCAGGCGCTCGCG 
CCCGACCTCTTCCACGGGCTGCGGAAGCTCACCACGCTGCATATGCGGGCCAACGCCATCCA 
GTTTGTGCCCGTGCGCATCTTCCAGGACTGCCGCAGCCTCAAGTTTCTCGACATCGGATACA 
ATCAGCTCAAGAGTCTGGCGCGCAACTCTTTCGCCGGCTTGTTTAAGCTCACCGAGCTGCAC 
CTCGAGCACAACGACTTGGTCAAGGTGAACTTCGCCCACTTCCCGCGCCTCATCTCCCTGCA 
CTCGCTCTGCCTGCGGAGGAACAAGGTGGCCATTGTGGTCAGCTCGCTGGAGTGGGTTTGGA 
ACCTGGAGAAAATGGACTTGTCGGGCAACGAGATCGAGTACATGGAGCCCCATGTGTTCGAG 
ACCGTGCCGCACCTGCAGTCCCTGCAGCTGGACTCCAACCGCCTCACCTACATCGAGCCCCG 
GATCCTCAACTCTTGGAAGTCCCTGACAAGCATCACCCTGGCCGGGAACCTGTGGGATTGCG 
GGCGCAACGTGTGTGCCCTAGCCTCGTGGCTCAGCAACTTCCAGGGGCGCTACGATGGCAAC 
TTGCAGTGCGCCAGCCCGGAGTACGCACAGGGCGAGGACGTCCTGGACGCCGTGTACGCCTT 
CCACCTGTGCGAGGATGGGGCCGAGCCCACCAGCGGCCACCTGCTCTCGGCCGTCACCAACC 
GCAGTGATCTGGGGCCCCCTGCCAGCTCGGCCACCACGCTCGCGGACGGCGGGGAGGGGCAG 
CACGACGGCACATTCGAGCCTGCCACCGTGGCTCTTCCAGGCGGCGAGCACGCCGAGAACGC 
CGTGCAGATCCACAAGGTGGTCACGGGCACCATGGCCCTCATCTTCTCCTTCCTCATCGTGG 
TCCTGGTGCTCTACGTGTCCTGGAAGTGTTTCCCAGCCAGCCTCAGGCAGCTCAGACAGTGC 
TTTGTCACGCAGCGCAGGAAGCAAAAGCAGAAACAGACCATGCATCAGATGGCTGCCATGTC 
TGCCCAGGAATACTACGTTGATTACAAACCGAACCACATTGAGGGAGCCCTGGTGATCATCA 
ACGAGTATGGCTCGTGTACCTGCCACCAGCAGCCCGCGAGGGAATGCGAGGTGTGATTGTCC 
CAGTGGCTCTCAACCCATGCGCTACCAAATACGCCTGGGCAGCCGGGACGGGCCGGCGGGCA 
CCAGGCTGGGGTCTCCTTGTCTGTGCTCTGATATGCTCCTTGACTGAAACTTTAAGGGGATC 
TCTCCCAGAGACTTGACATTTTAGCTTTATTGTGTCTTAAAAACAAAAGCGAATTAAAACAC 
AACAAAAAACCCCACCCCACAACCTTCAGGACAGTCTATCTTAAATTTCATATGAGAACTCC 
TTCCTCCCTTTGAAGATCTGTCCATATTCAGGAATCTGAGAGTGTAAAAAAGGTGGCCATAA 
GACAGAGAGAGAATAATCGTGCTTTGTTTTATGCTACTCCTCCCACCCTGCCCATGATTAAA 
CATCATGTATGTAGAAGATCTTAAGTCCATACGCATTTCATGAAGAACCATTGGAAAGAGGA 
ATCTGCAATCTGGGAGCTTAAGAGCAAATGATGACCATAGAAAGCTATGTTCTTACTTTGTG 
TGTGTGTCTGTATGTTTCTGCGTTGTGTGTCTTTGTAGGCAAGCAAACGTTGTCTACACAAA 
CGGGAATTTAGCTCACATCATTTCATGCCCCTGTGCCTCTAGCTCTGGAGATTGGTGGGGGG 
AGGTGGGGGGAAACGGCAGGAATAAGGGAAAGTGGTAGTTTTAACTAAGGTTTTGTAACACT 
TGAAATCTTTTCTTTCTCAAATTAATTATCTTTAAGCTTCAAGAAACTTGCTCTGACCCCTC 
TAAGCAAACTACTAAGCATTTAAAAGAGAATCTAATTTTTAAAGGTGTAGCACCTTTTTTTT 
TATTCTTCCCACAGAGGGTGCTAATCTCATTATGCTGTGCTATCTGAAAAGAACTTAAGGCC 
ACAATTCACGTCTCGTCCTGGGCATTGTGATGGATTGACCCTCCATTTGCAGTACCTTCCCA 
GCTGATTAAAGTTCAGCAGTGGTATTGAGGTTTTTCGAATATTTATATAGAAAAAAAGTCTT 
TTCACATGACAAATGACACTCTCACACCAGTCTTAGCCCTAGTAGTTTTTTAGGTTGGACCA 
GAGGAAGCAGGTTAAATGAGACCTGTCCTCTGCTGCACTCAGAAAAAATAGGCAGTCCCTGA 
TGCTCAGATCTTAGCCTTGATATTAATAGTTGAGACCACCTACCCACAATGCAGCCTATACT 
CCCAAGACTACAAAGTTACCATCGCAAAGGAAAGGTTATTCCAGTAAAAGGAAATAGTTTTC 
TCAACCATTTAAAAATATTCTTCTGAACTCATCAAAGTAGAAGAGCCCCCAACCTTTTCTCT 
CTGCCTTCAAGAAGGC^GACATTTGGTATGATTTAGCATCAACAACACATTTATGAGTATAT 
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FIGURE 195B 

GTAAGTAATCAGAGGGGCAAATGCCACTTGTTATTCCTCCCAAGTTTTCCAAGCAAGTACAC 
ACAGATCTCTGGTAGGATTAGGGGCCACTTGTGTTTCCGGCTTATTTTAGTCGACTTGTCAG 
CAAGTTTGATGCCTAGTCTATCTGACATGGCCCAGTAGAACAGGGCATTGATGGATCACATG 
AGATGGTAGAAGGAACATCATCACATACCCCTCTCACAGAGAAAATTATCAAAGAACCAGAA 
ATTATATCTGTTTTGGAGCAAGAGTGTCATAATGTTTCAGGGTAGTCAAAATAAACATAAAT 
TATCTCCTCTAGATGAGTGGCGATGTTGGCTGATTTGGGTCTGCCATTGACAGAATGTCAAA 
TAAAAAGGAATTAGCTAGAATATGACCATTAAATGTGCTTCTGAAATATATTTTGAGATAGG 
TTTAGAATGTCA 
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FIGURE 196 

MDFLLLGLCLYWLLRRPSGVVLCLLGACFQMLPAAPSGCPQLCRCEGRLLYCEALNLTEAPH 
NLSGLLGLSLRYNSLSELRAGQFTGLMQLTWLYLDHNHICSVQGDAFQKLRRVKELTLSSNQ 
ITQLPNTTFRPMPNLRSVDLSYNKLQALAPDLFHGLRKLTTLHMRANAIQFVPVRIFQDCRS 
LKFLDIGYNQLKSLARNSFAGLFKLTELHLEHNDLVKVNFAHFPRLISLHSLCLRRNKVAIV 
VSSLDWVWNLEKMDLSGNEIEYMEPHVFETVPHLQSLQLDSNRLTYIEPRILNSWKSLTSIT 
LAGNLWDCGRNVCALASWLSNFQGRYDGNLQCASPEYAQGEDVLDAVYAFHLCEDGAEPTSG 
HLLSAVTNRSDLGPPASSATTLADGGEGQHDGTFEPATVALPGGEHAENAVQIHKVVTGTMA 
LIFSFLIWLVLWSWKCFPASLRQLRQCFVTQRRKQKQKQTMHQMAAMSAQEYYVDYKPNH 
IEGALVI INEYGSCTCHQQPARECEV 
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FIGURE 197 

GTGCAAGGAGCCGAGGCGAGATGGGCGTCCTGGGCCGGGTCCTGCTGTGGCTGCAGCTCTGC 
GCACTGACCCAGGCGGTCTCCAAACTCTGGGTCCCCAACACGGACTTCGACGTCGCAGCCAA 
CTGGAGCCAGAACCGGACCCCGTGCGCCGGCGGCGCCGTTGAGTTCCCGGCGGACAAGATGG 
TGTCAGTCCTGGTGCAAGAAGGTCACGCCGTCTCAGACATGCTCCTGCCGCTGGATGGGGAA 
CTCGTCCTGGCTTCAGGAGCCGGATTCGGCGTCTCAGACGTGGGCTCGCACCTGGACTGTGG 
CGCGGGCGAACCTGCCGTCTTCCGCGACTCTGACCGCTTCTCCTGGCATGACCCGCACCTGT 
GGCGCTCTGGGGACGAGGCACCTGGCCTCTTCTTCGTGGACGCCGAGCGCGTGCCCTGCCGC 
CACGACGACGTCTTCTTTCCGCCTAGTGCCTCCTTCCGCGTGGGGCTCGGCCCTGGCGCTAG 
CCCCGTGCGTGTCCGCAGCATCTCGGCTCTGGGCCGGACGTTCACGCGCGACGAGGACCTGG 
CTGTTTTCCTGGCGTCCCGCGCGGGCCGCCTACGCTTCCACGGGCCGGGCGCGCTGAGCGTG 
GGCCCCGAGGACTGCGCGGACCCGTCGGGCTGCGTCTGCGGCAACGCGGAGGCGCAGCCGTG 
GATCTGCGCGGCCCTGCTCCAGCCCCT 
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MGVLGRVLLWLQLCALTQAVSKLWPNTDFDVAANWSQNRTPCAGGAVEFPADKMVSVLVQE 
GHAVSDMLLPLDGELVLASGAGFGVSDVGSHLDCGAGEPAVFRDSDRFSWHDPHLWRSGDEA 
PGLFFVDAERVPCRHDDVFFPPSASFRVGLGPGASPVRVRS I SALGRTFTRDEDLAVFLASR 
AGRLRFHGPGALSVGPEDCADPSGCVCGNAEAQPWI CAALLQP 
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FIGURE 199 

ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTT 

TGAGCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTT 

CTGCCTTTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCA 

TTGACGTGGTACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGC 

CGTGTGTCTTGCATAATTCATGGCCAGTTTTATGAAGCTTTGGAAGGCACTATGGACAGAAG 

CTGGTGGACAGTTTTGTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCT 

TGCAGCAATGTGTTGCTTGTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACAT 

TCTCGAACCTGAATGTCAGTAGCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAA 

TCTTCCTCATGTACCTGTTTCCTCTCTGGATGTTGTCCCACTGAATTCCCATGAATACAAAC 

CTATTCAGCAACAGCAAAAAAAAAAAAAAAAAAAAAAAAAATU^AAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAA 
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MGPVKQLKRMFEPTRLIATIMVLLCFAL^ 
FARDAVKKCFAVCLA 
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FIGURE 201 

TTGAGCGCAGGTGAGCTCCTGCGCGTTCCGGGGGCGTTCCTCCAGTCACCCTCCCGCCGTTA 
CCCGCGGCGCGCCCGAGGGAGTCTCCTCCAGACCCTCCCTCCCGTTGCTCCAAACTAATACG 
GACTGAACGGATCGCTGCGAGGGTGGGAGAGAAAATTAGGGGGAGAAAGGACAGAGAGAGCA 
ACTACCATCCATAGCCAGATAGATTATCTTACACTGAACTGATCAAGTACTTTGAAA ATGA C 
TTCGAAATTTATCTTGGTGTCCTTCATACTTGCTGCACTGAGTCTTTCAACCACCTTTTCTC 
TCCAACTAGACCAGCAAAAGGTTCTACTAGTTTCTTTTGATGGATTCCGTTGGGATTACTTA 
TATAAAGTTCCAACGCCCC^TTTTCATTATATTATGAAATATGGTGTTCACGTGAAGCAAGT 
TACTAATGTTTTTATTACAAAAACCTACCCTAACCATTATACTTTGGTAACTGGCCTCTTTG 
CAGAGAATCATGGGATTGTTGCAAATGATATGTTTGATCCTATTCGGAACAAATCTTTCTCC 
TTGGATCACATGAATATTTATGATTCCAAGTTTTGGGAAGAAGCGACACCAATATGGATCAC 
AAACCAGAGGGCAGGACATACTAGTGGTGCAGCCATGTGGCCCGGAACAGATGTAAAAATAC 
ATAAGCGCTTTCCTACTCATTACATGCCTTACAATGAGTCAGTTTCATTTGAAGATAGAGTT 
GCCAAAATTGTTGAATGGTTTACGTCAAAAGAGCCCATAAATCTTGGTCTTCTCTATTGGGA 
AGACCCTGATGACATGGGCCACCATTTGGGACCTGACAGTCCGCTCATGGGGCCTGTCATTT 
CAGATATTGACAAGAAGTTAGGATATCTCATACAAATGCTGAAAAAGGCAAAGTTGTGGAAC 
ACTCTGAACCTAATCATCACAAGTGATCATGGAATGACGCAGTGCTCTGAGGAAAGGTTAAT 
AGAACTTGACCAGTACCTGGATAAAGACCACTATACCCTGATTGATCAATCTCCAGTAGCAG 
CCATCTTGCCAAAAGAAGGTAAATTTGATGAAGTCTATGAAGCACTAACTCACGCTCATCCT 
AATCTTACTGTTTACAAAAAAGAAGACGTTCCAGAAAGGTGGCATTACAAATACAACAGTCG 
AATTCAACCAATCATAGCAGTGGCTGATGAAGGGTGGCACATTTTACAGAATAAGTCAGATG 
ACTTTCTGTTAGGCAACCACGGTTACGATAATGCGTTAGCAGATATGCATCCAATATTTTTA 
GCCCATGGTCCTGCCTTCAGAAAGAATTTCTCAAAAGAAGCCATGAACTCCACAGATTTGTA 
CCCACTACTATGCCACCTCCTCAATATCACTGCCATGCCACACAATGGATCATTCTGGAATG 
TCCAGGATCTGCTCAATTCAGCAATGCCAAGGGTGGTCCCTTATACACAGAGTACTATACTC 
CTCCCTGGTAGTGTTAAACCAGCAGAATATGACCAAGAGGGGTCATACCCTTATTTCATAGG 
GGTCTCTCTTGGCAGCATTATAGTGATTGTATTTTTTGTAATTTTCATTAAGCATTTAATTC 
ACAGTCAAATACCTGCCTTACAAGATATGCATGCTGAAATAGCTCAACCATTATTACAAGCC 
TAATGTTACTTTGAAGTGGATTTGCATATTGAAGTGGAGATTCCATAATTATGTCAGTGTTT 
AAAGGTTTCAAATTCTGGGAAACCAGTTCCAAACATCTGCAGAAACCATTAAGCAGTTACAT 
ATTTAGGTATACACACACACACACACACACACATACACACACACGGACCAAAATACTTACAC 
CTGCAAAGGAATAAAGATGTGAGAGTATGTCTCCATTGTTCACTGTAGCATAGGGATAGATA 
AGATCCTGCTTTATTTGGACTTGGCGCAGATAATGTATATATTTAGCAACTTTGCACTATGT 
AAAGTACCTTATATATTGCACTTTAAATTTCTCTCCTGATGGGTACTTTAATTTGAAATGCA 
CTTTATGGACAGTTATGTCTTATAACTTGATTGAAAATGACAACTTTTTGCACCCATGTCAC 
AGAATACTTGTTACGCATTGTTCAAACTGAAGGAAATTTCTAATAATCCCGAATAATGAACA 
TAGAAATCTATCTCCATAAATTGAGAGAAGAAGAAGGTGATAAGTGTTGAAAATTAAATGTG 
ATAACCTTTGAACCTTGAATTTTGGAGATGTATTCCCAACAGCAGAATGCAACTGTGGGCAT 
TTCTTGTCTTATTTCTTTCCAGAGAACGTGGTTTTCATTTATTTTTCCCTCAAAAGAGAGTC 
AAATACTGACAGATTCGTTCTAAATATATTGTTTCTGTCATAAAATTATTGTGATTTCCTGA 
TGAGTCATATTACTGTGATTTTCATAATAATGAAGACACCATGAATATACTTTTCTTCTATA 
TAGTTCAGCAATGGCCTGAATAGAAGCAACCAGGCACCATCTCAGCAATGTTTTCTCTTGTT 
TGTAATTATTTGCTCCTTTGAAAATTAAATCACTATTAATTACATTAAAAATCAAATTGGAT 
AAAAAAAAAAAAAAAAAAA 
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FIGURE 202 

MTSKFILVSFILAALSLSTTFSLQLDQQKVLLVSFDGFRWDYLYKVPTPHFHYIMKYGVHVK 
QVTNVF I TKT YPNHYTLVTGLF AENHG I VANDMFDP I RNKSFSLDHMN I YDS KF WEE ATP I W 
ITNQRAGHTSGAAMWPGTDVKIHKRFPTHYMPYNESVSFEDRVAKIVEWFTSKEPINLGLLY 
WEDPDDMGHHLGPDSPLMGPVISDIDKKLGYLIQMLKKAKLWNTLNLIITSDHGMTQCSEER 
LIELDQYLDKDHYTLIDQSPVAAILPKEGKFDEVYEALTHAHPNLTVYKKEDVPERWHYKYN 
SRIQPIIAVADEGWHILQNKSDDFLLGNHGYDNALADMHPIFLAHGPAFRKNFSKEAMNSTD 
LYPLLCHLLNITAMPHNGSFWNVQDLLNSAMPRWPYTQSTILLPGSVKPAEYDQEGSYPYF 
IGVSLGSIIVIVFFVIFIKHLIHSQIPALQDMHAEIAQPLLQA 
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FIGURE 203 

GGATTTTTGTGATCCGCGATTCGCTCCCACGGGCGGGACCTTTGTAACTGCGGGAGGCCCAG 
GACAGGCCCACCCTGCGGGGCGGGAGGCAGCCGGGGTGAGGGAGGTGAAGAAACCAAGACGC 
AGAGAGGCCAAGCCCCTTGCCTTGGGTCACACAGCCAAAGGAGGCAGAGCCAGAACTCACAA 
CCAGATCCAGAGGCAACAGGGACATGGCCACCTGGGACGAAAAGGCAGTCACCCGCAGGGCC 
AAGGTGGCTCCCGCTGAGAGGATGAGCAAGTTCTTAAGGCACTTCACGGTCGTGGGAGACGA 
CTACCATGCCTGGAACATCAACTACAAGAAATGGGAGAATGAAGAGGAGGAGGAGGAGGAGG 
AGCAGCCACCACCCACACCAGTCTCAGGCGAGGAAGGCAGAGCTGCAGCCCCTGACGTTGCC 
CCTGCCCCTGGCCCCGCACCCAGGGCCCCCCTTGACTTCAGGGGCATGTTGAGGAAACTGTT 
CAGCTCCCACAGGTTTCAGGTCATCATCATCTGCTTGGTGGTTCTGGATGCCCTCCTGGTGC 
TTGCTGAGCTCATCCTGGACCTGAAGATCATCCAGCCCGACAAGAATAACTATGCTGCCATG 
GTATTCCACTACATGAGCATCACCATCTTGGTCTTTTTTATGATGGAGATCATCTTTAAATT 
ATTTGTCTTCCGCCTGAGTTCTTTCACCACAAGTTTGAGATCCTGGATGCCCGTCGTGGTGG 
TGGTCTCATTCATCCTGGACATTGTCCTCCTGTTCCAGGAGCACCAGTTTGAGGCTCTGGGC 
CTGCTGATTCTGCTCCGGCTGTGGCGGGTGGCCCGGATCATCAATGGGATTATCATCTCAGT 
TAAGACACGTTCAGAACGGCAACTCTTAAGGTTAAAACAGATGAATGTACAATTGGCCGCCA 
AGATTCAACACCTTGAGTTCAGCTGCTCTGAGAAGCCCCTGGAC TGA TGAGTTTGCTGTATC 
AACCTGTAAGGAGAAGCTCTCTCCGGATGGCTATGGGAATGAAAGAATCCGACTTCTACTCT 
CACACAGCCACCGTGAAAGTCCTGGAGTAAAATGTGCTGTGTACAGAAGAGAGAGAAGGAAG 
CAGGCTGGCATGTTCACTGGGCTGGTGTTACGACAGAGAACCTGACAGTCACTGGCCAGTTA 
TCACTTGAGATTACAAATCACACAGAGCATCTGCCTGTTTTCAATCACAAGAGAACAAAACC 
AAAATCTATAAAGATATTCTGAAAATATGACAGAATTTGACAAATAAAAGCATAAACGTGTA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MATWDEKAVTRRAKVAPAERMSKF^ 

SGEEGRAAAPDVAPAPGPAPRAPLDFRGMLRKLFSSHRFQVI 1 1 CLWLDALLVLAELILDL 
KI IQPDKNNYAAMVFH YMS I T I LVFFMME 1 1 FKLFVFRLS SFTTSLRSWMPVWWSF I LDI 
VLLFQEHQFE ALGLL I LLRL WRVARI INGI I ISVKTRSERQLLRLKQMNVQLAAKIQHLEFS 
CSEKPLD 
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CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACAT 
GCTCTGTCTGTGCCTGTACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTG 
AGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCCATTTTCAAGCTCAGTGTCTTCATCCCCTCC 
CAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCTGGAGATAAGGACCT 
TGATGGGCAGCTAGACTTTGAAGAATTTGTCCATTATCTCCAAGATCATGAGAAGAAGCTGA 
GGCTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATG 
CAGTCCCTGCGGGACTTGGGAGTCAAGATATCTGAACAGCAGGCAGAAAAAATTCTCAAGAG 
CATGGATAAAAACGGCACGATGACCATCGACTGGAACGAGTGGAGAGACTACCACCTCCTCC 
ACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACGATCTTTGATGTG 
GGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTG 
GAGACACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGG 
ACAGGCTCAAGGTGCTCATGCAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGT 
GGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGGTCACTCTGGCGGGGCAATGGCATCAA 
CGTCCTCAAAATTGCCCCCGAATCAGCCATCAAATTCATGGCCTATGAGCAGATCAAGCGCC 
TTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCA 
GGGGCCATCGCCCAGAGCAGCATCTACCCAATGGAGGTCCTGAAGACCCGGATGGCGCTGCG 
GAAGACAGGCCAGTACTCAGGAATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGG 
TGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTGGGCATCATCCCCTATGCCGGCATC 
GACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTGAACAGCGC 
GGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGG 
CCAGCTACCCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCT 
CCGGAGGTGACCATGAGCAGCCTCTTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCT 
GTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCAGCTGTGAGCATCAGCTACGTGG 
TCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGCCGCCCG 
GCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATG 
TGCCAACACTAAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGT 
GGGGAGAGCTGGCAGGCCCAGGGCTTGTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCC 
AGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGGCTCCGGGCTCACATGTGTAA 
GGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCTTAGT 
TCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGGCGTGC 
ATCTCCCTGTGCCCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAG 
CCCACATCCCACCCCCTCGTCCAATCCCATAATCCATGATGAAAGGTGAGGTCACGTGGCCT 
CCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGCTGTGAAGGAAGAGGAAAG 
GATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATGCT 
TGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCAGAGAAGGCAAGTGCTGGGGCTCA 
TGGTGCTCTGAGCTGGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTG 
TCCCCACTGTGGCATGAGGGCAGTGGAGCACCATGTTTGAGGGCGAAGGGCAGAGCGTTTGT 
GTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGGACTGTTGGGAAAAGGG 
TTTTGTCCAGAAGGACAAGCCGGACAAATGAGCGACTTCTGTGCTTCCAGAGGAAGACGAGG 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACC 
CCAGCAGGGGCGCAGCGGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTT 
ATTTTGTATTTATTTGAACAGAGTTATGTCCTAACTATTTTTATAGATTTGTTTAATTAATA 
GCTTGTCATTTTCAAGTTCATTTTTTATTCATATTTATGTTCATGGTTGATTGTACCTTCCC 
AAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACAT 
CTGTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTG 
GCTCCTTTCCTTTGGCAGGTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGA 
CTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAATAACCTTGAAGGTGGAATCCAGTTATTTC 
CTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAGTGAGGTGCCTCT 
CACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCC 
CAGCCTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCC 
ATTCCACCAGAATGACCTGATGAGGAAATCTTCAATAGGATGCAAAGATCAATGCAAAAATT 
GTTATATATGAACATATAACTGGAGTCGTCAAAAAGCAAATTAAGAAAGAATTGGACGTTAG 
AAGTTGTCATTTAAAGCAGCCTTCTAATAAAGTTGTTTCAAAGCTGAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 206 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKS IFKLSVFI PSQEFSTYRQWKQKIVQAGDKD 
LDGQLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILK 
Sl^KNGTMTIDraEWRDYHLLHPVENIPEIILYWKHSTIFDVGENLTVPDEFTVEERQTG^ 
WRHLVAGGGAGAVS RT CT APLDRLKVLMQVHASRSNNMG I VGG FTQM I REGGARSLWRGNG I 
NVLKIAPESAIKFI^YEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMAL 
RKTGQ YSGMLDCARR I LAREGVAAF YKG YVPNMLG 1 1 P YAG I DLAVYETLKNAWLQH YAVNS 
ADPGVFVLLACGTMSSTCGQIiASYPLALVRTRMQAQASIEGAPEVTMSSLFKHILRTEGAFG 
LYRGLAPNFMKVIPAVSiSYWYENLKITLGVQSR 
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GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCAT 
6GCTTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAG 
CAATTGCACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTC 
GCCTCAGCTGGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAA 
ACTTTCTGATATCGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCA 
AAGAAGGCAAAGATGAGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTT 
GCTGATCAAGTGATAGTTGGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGC 
TGGCACCTACAAATGTTATATCATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATA 
AAACTGGAGCCTTCAGCATGCCGGAAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTG 
CGGTGTGAGGCTCCCCGATGGTTCCCCCAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCA 
GGGAGCCAACTTCTCGGAAGTCTCCAATACCAGCTTTGAGCTGAACTCTGAGAATGTGACCA 
TGAAGGTTGTGTCTGTGCTCTACT^ATGTTACGATCAACAACACATACTCCTGTATGATTGAA 
AATGACATTGCCAAAGCAACAGGGGATATCAAAGTGACAGAATCGGAGATCAAAAGGCGGAG 
TCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGTCTCTTCTTTCTTTGCCATCAGCT 
GGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAATGTGCCTTGGCCACAAAAAAG 
CATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCACCACCAGATATGACCTAG 
TTTTATATTTCTGGGAGGAAATGAATTCATATCTAGAAGTCTGGAGTGAGCAAACAAGAGCA 
AGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAATCTATCTTCAAA 
GACATATTAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGAGTGATAAG 
TAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGTCACCT 
GGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGCTG 
TAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCA 
CAAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGG 
GGCGGCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCT 
TGGCTTCTCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAA 
ACAGAGCAGTCGGGGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 208 

MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDI 
KLSDIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTD 
AGTYKCYI ITSKGKGNANLEYKTGAFSMPEVNVDYNAS SETLRCEAPRWFPQPTWWASQVD 
QGANFSEVSNTS FELNSENVTMKWS VLYNVT INNTYS CM I END I AKATGD I KVTESE I KRR 
SHLQLLNSKASLCVSSFFAISWALLPLSPYLMLK 
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FIGURE 209 

GAATTTGTAGAAGACAGCGGCGTTGCCATGGCGGCGTCTCTGGGGCAGGTGTTGGCTCTGGT 
GCTGGTGGCCGCTCTGTGGGGTGGCACGCAGCCGCTGCTGAAGCGGGCCTCCGCCGGCCTGC 
AGCGGGTTCATGAGCCGACCTGGGCCCAGCAGTTGCTACAGGAGATGAAGACCCTCTTCTTG 
AATACTGAGTACCTGATGCCCTTTCTCCTCAACCAGTGTGGATCCCTTCTCTATTACCTCAC 
CTTGGCATCGACAGATCTGACCCTGGCTGTGCCCATCTGTAACTCTCTGGCTATCATCTTCA 
CACTGATTGTTGGGAAGGCCCTTGGAGAAGATATTGGTGGAAAACGTAAGTTAGACTACTGC 
GAGTGCGGGACGCAGCTCTGTGGATCTCGACATACCTGTGTTAGTTCCTTCCCAGAACCCAT 
CTCCCCAGAGTGGGTGAGGACACGGCCTTTTCCCATCCTGCCCTTTCCTCTGCAGCTGTTTT 
GCTTCCTTGTGGCCATCAGAGTTCCCTTCCCCTGGACAGTCTGGAGAAAGACAGAGGCTGGG 
GTTTGGGATTGAAGACCAGACCCCATCTGAGCCCTTCCTCCAGCCCTGTACCAGCTCCTACT 
GGCATGGCTGAGCTCAGACCCTCCTGATTTCTGCCTATTATCCCAGGAGCAGTTGCTGGCAT 
GGTGCTCACCGTGATAGGAATTTCACTCTGCATCACAAGCTCAGTGAGTAAGACCCAGGGGC 
AACAGTCTACCCTTTGAGTGGGCCGAACCCACTTCCAGCTCTGCTGCCTCCAGGAAGCCCCT 
GGGCCATGAAGTGCTGGCAGTGAGCGGATGGACCTAGCACTTCCCCTCTCTGGCCTTAGCTT 
CCTCCTCTCTTATGGGGATAACAGCTACCTCATGGATCACAATAAGAGAACAAGAGTGAAAG 
AGTTTTGTAACCTTCAAGTGCTGTTCAGCTGCGGGGATTTAGCACAGGAGACTCTACGCTCA 
CCCTCAGCAACCTTTCTGCCCCAGCAGCTCTCTTCCTGCTAACATCTCAGGCTCCCAGCCCA 
GCCACCATTACTGTGGCCTGATCTGGACTATCATGGTGGCAGGTTCCATGGACTGCAGAACT 
CCAGCTGCATGGAAAGGGCCAGCTGCAGACTTTGAGCCAGAAATGCAAACGGGAGGCCTCTG 
GGACTCAGTCAGAGCGCTTTGGCTGAATGAGGGGTGGAACCGAGGGAAGAAGGTGCGTCGGA 
GTGGCAGATGCAGGAAATGAGCTGTCTATTAGCCTTGCCTGCCCCACCCATGAGGTAGGCAG 
AAATCCTCACTGCCAGCCCCTCTTAAACAGGTAGAGAGCTGTGAGCCCCAGCCCCACCTGAC 
TCCAGCACACCTGGCGAGTAGTAGCTGTCAATAAATCTATGTAAACAGACAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 



WO 99/63088 , / PCT/US99/12252 

•=€^14/310 

FIGURE 210 

MAASLGQVLALVLVAALWGGTQPLLKRASAGLQRVHEPTWAQQLLQEMKTLFLNTEYLMPFL 
LNQCGSLLYYLTLASTDLTl^VPICNSLAIIFTLIVGKALGEDIGGKRKLDYCECGTQLCGS 
RHTCVSSFPEPISPEWVRTRPFPILPFPLQLFCFLVAIRVPFPWTVWRKTEAGVWD 
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FIGURE 211 

CTTCTGTAGGACAGTCACCAGGCCAGATCCAGAAGCCTCTCTAGGCTCCAGCTTTCTCTGTG 
GAAGATGACAGCAATTATAGCAGGACCCTGCCAGGCTGTCGAAAAGATTCCGCAATAAAACT 
TTGCCAGTGGGAAGTACCTAGTGAAACGGCCTAAGATGCCACTTCTTCTCATGTCCCAGGCT 
TGAGGCCCTGTGGTCCCCATCCTTGGGAGAAGTCAGCTCCAGCACCATGAAGGGCATCCTCG 
TTGCTGGTATCACTGCAGTGCTTGTTGCAGCTGTAGAATCTCTGAGCTGCGTGCAGTGTAAT 
TCATGGGAA7^AATCCTGTGTCAACAGCATTGCCTCTGAATGTCCCTCACATGCCAACACCAG 
CTGTATCAGCTCCTCAGCCAGCTCCTCTCTAGAGACACCAGTCAGATTATACCAGAATATGT 
TCTGCTCAGCGGAGAACTGCAGTGAGGAGACACACATTACAGCCTTCACTGTCCACGTGTCT 
GCTGAAGAACACTTTCATTTTGTAAGCCAGTGCTGCCAAGGAAAGGAATGCAGCAACACCAG 
CGATGCCCTGGACCCTCCCCTGAAGAACGTGTCCAGCAACGCAGAGTGCCCTGCTTGTTATG 
AATCTAATGGAACTTCCTGTCGTGGGAAGCCCTGGAAATGCTATGAAGAAGAACAGTGTGTC 
TTTCTAGTTGCAGAACTTAAGAATGACATTGAGTCTAAGAGTCTCGTGCTGAAAGGCTGTTC 
CAACGTCAGTAACGCCACCTGTCAGTTCCTGTCTGGTGAAAACAAGACTCTTGGAGGAGTCA 
TCTTTCGAAAGTTTGAGTGTGCAAATGTAAACAGCTTAACCCCCACGTCTGCACCAACCACT 
TCCCACAACGTGGGCTCCAAAGCTTCCCTCTACCTCTTGGCCCTTGCCAGCCTCCTTCTTCG 
GGGACTGCTGCCCTGAGGTCCTGGGGCTGCACTTTGCCCAGCACCCCATTTCTGCTTCTCTG 
AGGTCCAGAGCACCCCCTGCGGTGCTGACACCCTCTTTCCCTGCTCTGCCCCGTTTAACTGC 
CCAGTAAGTGGGAGTCACAGGTCTCCAGGCAATGCCGACAGCTGCCTTGTTCTTCATTATTA 
AAGGACTGGTTCATTCACTGCCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 212 

MKGILVAGITAVLVAAVESLSCVQCNSWEKSCVNSIASECPSHANTSCISSSASSSLETPVR 
LYQNMFCSAENCSEETHITAFTVHVSAEEHFHFVSQCCQGKECSNTSDALDPPLKNVSSNAE 
CPACYESNGTSCRGKPWKCYEEEQCVFLVAELKNDIESKSLVLKGCSNVSNATCQFLSGENK 
TLGGVIFRKFECANVNSLTPTSAPTTSHNVGSKASLYLLALASLLLRGLLP 
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FIGURE 213 

GGCCTCGGTTCAAACGACCCGGTGGGTCTACAGCGGAAGGGAGGGAGCGAAGGTAGGAGGCA 

GGGCTTGCCTCACTGGCCACCCTCCCAACCCCAAGAGCCCAGCCCCATGGTCCCCGCCGCCG 

GCGCGCTGCTGTGGGTCCTGCTGCTGAATCTGGGTCCCCGGGCGGCGGGGGCCCAAGGCCTG 

ACCCAGACTCCGACCGAAATGCAGCGGGTCAGTTTACGCTTTGGGGGCCCCATGACCCGCAG 

CTACCGGAGCACCGCCCGGACTGGTCTTCCCCGGAAGACAAGGATAATCCTAGAGGACGAGA 

ATGATGCCATGGCCGACGCCGACCGCCTGGCTGGACCAGCGGCTGCCGAGCTCTTGGCCGCC 

ACGGTGTCCACCGGCTTTAGCCGGTCGTCCGCCATTAACGAGGAGGATGGGTCTTCAGAAGA 

GGGGGTTGTGATTAATGCCGGAAAGGATAGCACCAGCAGAGAGCTTCCCAGTGCGACTCCCA 

ATACAGCGGGGAGTTCCAGCACGAGGTTTATAGCCAATAGTCAGGAGCCTGAAATCAGGCTG 

ACTTCAAGCCTGCCGCGCTCCCCCGGGAGGTCTACTGAGGACCTGCCAGGCTCGCAGGCCAC 

CCTGAGCCAGTGGTCCACACCTGGGTCTACCCCGAGCCGGTGGCCGTCACCCTCACCCACAG 

CCATGCCATCTCCTGAGGATCTGCGGCTGGTGCTGATGCCCTGGGGCCCGTGGCACTGCCAC 

TGCAAGTCGGGCACCATGAGCCGGAGCCGGTCTGGGAAGCTGCACGGCCTTTCCGGGCGCCT 

TCGAGTTGGGGCGCTGAGCCAGCTCCGCACGGAGCACAAGCCTTGCACCTATCAACAATGTC 

CCTGCAACCGACTTCGGGAAGAGTGCCCCCTGGACACAAGTCTCTGTACTGACACCAACTGT 

GCCTCTCAGAGCACCACCAGTACCAGGACCACCACTACCCCCTTCCCCACCATCCACCTCAG 

AAGCAGTCCCAGCCTGCCACCCGCCAGCCCCTGCCCAGCCCTGGCTTTTTGGAAACGGGTCA 

GGATTGGCCTGGAGGATATTTGGAATAGCCTCTCTTCAGTGTTCACAGAGATGCAACCAATA 

GACAGAAACCAGAGGTAATGGCCACTTCATCCACATGAGGAGATGTCAGTATCTCAACCTCT 

CTTGCCCTTTCAATCCTAGCACCCACTAGATATTTTTAGTACAGAAAAACAAAACTGGAAAA 

CACAA 
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FIGURE 214 

MVPAAGALLWVLLLNLGPRAAGAQGLTQTPTEMQRVSLRFGGPMTRSYRSTARTGLPRKTRI 
ILEDENDAMADADRLAGPAAAELLAATVSTGFSRSSAINEEDGSSEEGWINAGKDSTSREL 
PSATPNTAGSSSTRFIANSQEPEIRLTSSLPRSPGRSTEDLPGSQATLSQWSTPGSTPSRWP 
SPSPTAMPSPEDLRLVLMPWGPWHCHCKSGTMSRSRSGKLHGLSGRLRVGALSQLRTEHKPC 
TYQQCPCNRLREECPLDTSLCTDTNCASQSTTSTRTTTTPFPTIHLRSSPSLPPASPCPALA 
FWKR VR I GLED I WNS LS S VFTEMQ P I DRNQR 
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CCCGGGTCGACCCACGCGTCCGGGGAGAAAGGATGGCCGGCCTGGCGGCGCGGTTGGTCCTG 
CTAGCTGGGGCAGCGGCGCTGGCGAGCGGCTCCCAGGGCGACCGTGAGCCGGTGTACCGCGA 
CTGCGTACTGCAGTGCGAAGAGCAGAACTGCTCTGGGGGCGCTCTGAATCACTTCCGCTCCC 
GCCAGCCAATCTACATGAGTCTAGCAGGCTGGACCTGTCGGGACGACTGTAAGTATGAGTGT 
ATGTGGGTCACCGTTGGGCTCTACCTCCAGGAAGGTCACAAAGTGCCTCAGTTCCATGGCAA 
GTGGCCCTTCTCCCGGTTCCTGTTCTTTCAAGAGCCGGCATCGGCCGTGGCCTCGTTTCTCA 
ATGGCCTGGCCAGCCTGGTGATGCTCTGCCGCTACCGCACCTTCGTGCCAGCCTCCTCCCCC 
ATGTACCACACCTGTGTGGCCTTCGCCTGGGTGTCCCTCAATGCATGGTTCTGGTCCACAGT 
CTTCCACACCAGGGACACTGACCTCACAGAGAAAATGGACTACTTCTGTGCCTCCACTGTCA 
TCCTACACTCAATCTACCTGTGCTGCGTCAGGACCGTGGGGCTGCAGCACCCAGCTGTGGTC 
AGTGCCTTCCGGGCTCTCCTGCTGCTCATGCTGACCGTGCACGTCTCCTACCTGAGCCTCAT 
CCGCTTCGACTATGGCTACAACCTGGTGGCCAACGTGGCTATTGGCCTGGTCAACGTGGTGT 
GGTGGCTGGCCTGGTGCCTGTGGAACCAGCGGCGGCTGCCTCACGTGCGCAAGTGCGTGGTG 
GTGGTCTTGCTGCTGCAGGGGCTGTCCCTGCTCGAGCTGCTTGACTTCCCACCGCTCTTCTG 
GGTCCTGGATGCCCATGCCATCTGGCACATCAGCACCATCCCTGTCCACGTCCTCTTTTTCA 
GCTTTCTGGAAGATGACAGCCTGTACCTGCTGAAGGAATCAGAGGACAAGTTCAAGCTGGAC 
TGAAGACCTTGGAGCGAGTCTGCCCCAGTGGGGATCCTGCCCCCGCCCTGCTGGCCTCCCTT 
CTCCCCTCAACCCTTGAGATGATTTTCTCTTTTCAACTTCTTGAACTTGGACATGAAGGATG 
TGGGCCCAGAATCATGTGGCCAGCCCACCCCCTGTTGGCCCTCACCAGCCTTGGAGTCTGTT 
CTAGGGAAGGCCTCCCAGCATCTGGGACTCGAGAGTGGGCAGCCCCTCTACCTCCTGGAGCT 
GAACTGGGGTGGAACTGAGTGTGTTCTTAGCTCTACCGGGAGGACAGCTGCCTGTTTCCTCC 
CCACCAGCCTCCTCCCCACATCCCCAGCTGCCTGGCTGGGTCCTGAAGCCCTCTGTCTACCT 
GGGAGACCAGGGACCACAGGCCTTAGGGATACAGGGGGTCCCCTTCTGTTACCACCCCCCAC 
CCTCCTCCAGGACACCACTAGGTGGTGCTGGATGCTTGTTCTTTGGCCAGCCAAGGTTCACG 
GCGATTCTCCCCATGGGATCTTGAGGGACCAAGCTGCTGGGATTGGGAAGGAGTTTCACCCT 
GACCGTTGCCCTAGCCAGGTTCCCAGGAGGCCTCACCATACTCCCTTTCAGGGCCAGGGCTC 
CAGCAAGCCCAGGGCAAGGATCCTGTGCTGCTGTCTGGTTGAGAGCCTGCCACCGTGTGTCG 
GGAGTGTGGGCCAGGCTGAGTGCATAGGTGACAGGGCCGTGAGCATGGGCCTGGGTGTGTGT 
GAGCTCAGGCCTAGGTGCGCAGTGTGGAGACGGGTGTTGTCGGGGAAGAGGTGTGGCTTCAA 
AGTGTGTGTGTGCAGGGGGTGGGTGTGTTAGCGTGGGTTAGGGGAACGTGTGTGCGCGTGCT 
GGTGGGCATGTGAGATGAGTGACTGCCGGTGAATGTGTCCACAGTTGAGAGGTTGGAGCAGG 
ATGAGGGAATCCTGTCACCATCAATAATCACTTGTGGAGCGCCAGCTCTGCCCAAGACGCCA 
CCTGGGCGGACAGCCAGGAGCTCTCCATGGCCAGGCTGCCTGTGTGCATGTTCCCTGTCTGG 
TGCCCCTTTGCCCGCCTCCTGCAAACCTCACAGGGTCCCCACACAACAGTGCCCTCCAGAAG 
CAGCCCCTCGGAGGCAGAGGAAGGAAAATGGGGATGGCTGGGGCTCTCTCCATCCTCCTTTT 
CTCCTTGCCTTCGCATGGCTGGCCTTCCCCTCCAAAACCTCCATTCCCCTGCTGCCAGCCCC 
TTTGCCATAGCCTGATTTTGGGGAGGAGGAAGGGGCGATTTGAGGGAGAAGGGGAGAAAGCT 
TATGGCTGGGTCTGGTTTCTTCCCTTCCCAGAGGGTCTTACTGTTCCAGGGTGGCCCCAGGG 
CAGGCAGGGGCCACACTATGCCTGTGCCCTGGTAAAGGTGACCCCTGCCATTTACCAGCAGC 
CCTGGCATGTTCCTGCCCCACAGGAATAGAATGGAGGGAGCTCCAGAAACTTTCCATCCCAA 
AGGCAGTCTCCGTGGTTGAAGCAGACTGGATTTTTGCTCTGCCCCTGACCCCTTGTCCCTCT 
TTGAGGGAGGGGAGCTATGCTAGGACTCCAACCTCAGGGACTCGGGTGGCCTGCGCTAGCTT 
CTTTTGATACTGAAAACTTTTAAGGTGGGAGGGTGGCAAGGGATGTGCTTAATAAATCAATT 
CG^GCCTCAAAAAAAAAAAAAAAAA 
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FIGURE 216 

I^GLAARLVLLAGAAAIASGSQGDREPVYRDCVLQCEEQNCSGGALNHFR 
TCRDDCKYECMWVTVGLYLQEGHKVPQFH^ 

YRTFVPASSPMYHTCVAFAWVSLNAWFWSTWHTRDTDLTEKMDYFCASTVILHSIYLCCVR 
WGLQHPAWSAFRALLLLMLTVHVSYLSLIRFDYGYNLV^ 

RLPHVRKC WWLLLQGLSLLE LLDF PPLF WVLDAHAI WH I S T I PVHVLFFS FLEDDSLYLL 
KESEDKFKLD 
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FIGURE 217 

GGCCGCCTGGAATTGTGGGAGTTGTGTCTGCCACTCGGCTGCCGGAGGCCGAAGGTCCGTGA 

CTATGGCTCCCCAGAGCCTGCCTTCATCTAGGATGGCTCCTCTGGGCATGCTGCTTGGGCTG 

CTGATGGCCGCCTGCTTCACCTTCTGCCTCAGTCATCAGAACCTGAAGGAGTTTGCCCTGAC 

CAACCCAGAGAAGAGCAGCACCAAAGAAACGGAGAGAAAAGAAACCAAAGCCGAGGAGGAGC 

TGGATGCCGAAGTCCTGGAGGTGTTCCACCCGACGCATGAGTGGCAGGCCCTTCAGCCAGGG 

CAGGCTGTCCCTGCAGGATCCCACGTACGGCTGAATCTTCAGACTGGGGAAAGAGAGGCAAA 

ACTCCAATATGAGGACAAGTTCCGAAATAATTTGAAAGGCAAAAGGCTGGATATCAACACCA 

ACACCTACACATCTCAGGATCTCAAGAGTGCACTGGCAAAATTCAAGGAGGGGGCAGAGATG 

GAGAGTTCAAAGGAAGACAAGGCAAGGCAGGCTGAGGTAAAGCGGCTCTTCCGCCCCATTGA 

GGAACTGAAGAAAGACTTTGATGAGCTGAATGTTGTCATTGAGACTGACATGCAGATCATGG 

TACGGCTGATCAACAAGTTCAATAGTTCCAGCTCCAGTTTGGAAGAGAAGATTGCTGCGCTC 

TTTGATCTTGAATATTATGTCCATCAGATGGACAATGCGCAGGACCTGCTTTCCTTTGGTGG 

TCTTCAAGTGGTGATCAATGGGCTGAACAGCACAGAGCCCCTCGTGAAGGAGTATGCTGCGT 

TTGTGCTGGGCGCTGCCTTTTCCAGCAACCCCAAGGTCCAGGTGGAGGCCATCGAAGGGGGA 

GCCCTGCAGAAGCTGCTGGTCATCCTGGCCACGGAGCAGCCGCTCACTGCAAAGAAGAAGGT 

CCTGTTTGCACTGTGCTCCCTGCTGCGCCACTTCCCCTATGCCCAGCGGCAGTTCCTGAAGC 

TCGGGGGGCTGCAGGTCCTGAGGACCCTGGTGCAGGAGAAGGGCACGGAGGTGCTCGCCGTG 

CGCGTGGTCACACTGCTCTACGACCTGGTCACGGAGAAGATGTTCGCCGAGGAGGAGGCTGA 

GCTGACCCAGGAGATGTCCCCAGAGAAGCTGCAGCAGTATCGCCAGGTACACCTCCTGCCAG 

GCCTGTGGGAACAGGGCTGGTGCGAGATCACGGCCCACCTCCTGGCGCTGCCCGAGCATGAT 

GCCCGTGAGAAGGTGCTGCAGACACTGGGCGTCCTCCTGACCACCTGCCGGGACCGCTACCG 

TCAGGACCCCCAGCTCGGCAGGACACTGGCCAGCCTGCAGGCTGAGTACCAGGTGCTGGCCA 

GCCTGGAGCTGCAGGATGGTGAGGACGAGGGCTACTTCCAGGAGCTGCTGGGCTCTGTCAAC 

AGCTTGCTGAAGGAGCTGAGATGAGGCCCCACACCAGGACTGGACTGGGATGCCGCTAGTGA 

GGCTGAGGGGTGCCAGCGTGGGTGGGCTTCTCAGGCAGGAGGACATCTTGGCAGTGCTGGCT 

TGGCCATTAAATGGAAACCTGAAGGCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 218 

MAPQSLPSSRMAPLGMLLGLLMAACFTFCLSHQNLKEFALTNPEKSSTKETERKETKAEEEL 
DAEVLEVFHPTHEWQALQPGQAVPAGSHVRLNLQTGEREAKLQYEDKFRNNLKGKRLDINTN 
TYTSQDLKSALAKFKEGAEMESSKEDKARQAEVKRLFRPIEELKKDFDELNWIETDMQIMV 
RLINKFNSSSSSLEEKIAALFDLEYYVHQMDNAQDLLSFGGLQWINGLNSTEPLVKEYAAF 
VLGAAFSSNPKVQVEAIEGGALQKLLVILATEQPLTAKKKVLFALCSLLRHFPYAQRQFLKL 
GGLQVLRTLVQEKGTEVLAVRVVTLLYDLVTEKMFAEEEAELTQEMSPEKLQQYRQVHLLPG 
LWEQGWCEITAHLLALPEHDAREKVLQTLGVLLTTCRDRYRQDPQLGRTLASLQAEYQVLAS 
LELQDGEDEGYFQELLGSVNSLLKELR 
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FIGURE 219 

TTCGGCTTCCGTAGAGGAAGTGGCGCGGACCTTCATTTGGGGTTTCGGTTCCCCCCCTTCCC 

CTTCCCCGGGGTCTGGGGGTGACATTGCACCGCGCCCCTCGTGGGGTCGCGTTGCCACCCCA 

CGCGGACTCCCCAGCTGGCGCGCCCCTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCC 

TTCCCACCTGACCAGCCATGGGGGCTGCGGTGTTTTTCGGCTGCACTTTCGTCGCGTTCGGC 

CCGGCCTTCGCGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGT 

CGCAGGGGCATTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGG 

TCCATGTGACCGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCT 

GTCTCTGTCCTTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGA 

TGAAGGGTTAGCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCT 

ATGTTTCTGGTCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCT 

GATGCACTTGGGCCAGGTGTGGTTGGGATCCATGGAGACTCACCCTATTACTTCCTGACTTC 

AGCCTTTCTGACAGCAGCCATTATCCTGCTCCATACCTTTTGGGGAGTTGTGTTCTTTGATG 

CCTGTGAGAGGAGACGGTACTGGGCTTTGGGCCTGGTGGTTGGGAGTCACCTACTGACATCG 

GGACTGACATTCCTGAACCCCTGGTATGAGGCCAGCCTGCTGCCCATCTATGCAGTCACTGT 

TTCCATGGGGCTCTGGGCCTTCATCACAGCTGGAGGGTCCCTCCGAAGTATTCAGCGCAGCC 

TCTTGTGTAAGGACTGACTACCTGGACTGATCGCCTGACAGATCCCACCTGCCTGTCCACTG 

CCCATGACTGAGCCCAGCCCCAGCCCGGGTCCATTGCCCACATTCTCTGTCTCCTTCTCGTC 

GGTCTACCCCACTACCTCCAGGGTTTTGCTTTGTCCTTTTGTGACCGTTAGTCTCTAAGCTT 

TACCAGGAGCAGCCTGGGTTCAGCCAGTCAGTGACTGGTGGGTTTGAATCTGCACTTATCCC 

CACCACCTGGGGACCCCCTTGTTGTGTCCAGGACTCCCCCTGTGTCAGTGCTCTGCTCTCAC 

CCTGCCCAAGACTCACCTCCCTTCCCCTCTGCAGGCCGACGGCAGGAGGACAGTCGGGTGAT 

GGTGTATTCTGCCCTGCGCATCCCACCCGAGGACTGAGGGAACCTAGGGGGGACCCCTGGGC 

CTGGGGTGCCCTCCTGATGTCCTCGCCCTGTATTTCTCCATCTCCAGTTCTGGACAGTGCAG 

GTTGCCAAGAAAAGGGACCTAGTTTAGCCATTGCCCTGGAGATGAAATTAATGGAGGCTCAA 

GGATAGATGAGCTCTGAGTTTCTCAGTACTCCCTCAAGACTGGACATCTTGGTCTTTTTCTC 

AGGCCTGAGGGGGAACCATTTTTGGTGTGATAAATACCCTAAACTGCCTTTTTTTCTTTTTT 

GAGGTGGGGGGAGGGAGGAGGTATATTGGAACTCTTCTAACCTCCTTGGGCTATATTTTCTC 

TCCTCGAGTTGCTCCTCATGGCTGGGCTCATTTCGGTCCCTTTCTCCTTGGTCCCAGACCTT 

GGGGGAAAGGAAGGAAGTGCATGTTTGGGAACTGGCATTACTGGAACTAATGGTTTTAACCT 

CCTTAACCACCAGCATCCCTCCTCTCCCCAAGGTGAAGTGGAGGGTGCTGTGGTGAGCTGGC 

GACTCCAGAGCTGCAGTGCCACTGGAGGAGTCAGACTACCATGACATCGTAGGGAAGGAGGG 

GAGATTTTTTTGTAGTTTTTAATTGGGGTGTGGGAGGGGCGGGGAGGTTTTCTATAAACTGT 

ATCATTTTCTGCTGAGGGTGGAGTGTCCCATCCTTTTAATCAAGGTGATTGTGATTTTGACT 

AATAAAAAAGAATTTGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAJ^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 220 

MGAAVFFGCTFVAFGPAFALFL I TVAGD PLRV 1 1 LVAGAF FWLVSLLLAS WWF I LVHVTDR 
SDARLQYGLL I FGAAVSVLLQE VFRFAYYKLLKKADEGLASLSEDGRS P I S I RQMAYVSGLS 
FGI ISGVFSVINILADALGPGWGIHGDSPYYFLTSAFLTAAI ILLHTFWGWFFDACERRR 
YWALGLWGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQRSLLCKD 
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FIGURE 221 

AAGCTGGTTTAAGGAAGCAGAGGAGGGTTAGATTCGTTGAGTGAGGACGGAAGATCAACCCA 
TTTCCATTCCGCCAGATGGCCTATGTTTCTGGTCTCTCCCTTCGGNATCATCAGTGGTGTNT 
TNTCTGTTATCAATATTTTGGCTGATGCANTTGGGCCAGGTGTGGTTGGGATCCATGGAGAC 
TCACCCTATTANTTCCTGANTTCAGCCTTTNTGACAGCAGCCATTATCCTGCTC 
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FIGURE 222 

GACCGACCGTTCAGATGCCCGGTTCCAGTACGGCTTCCTGATTTTTGGTGCTGCTGTNTCTG 
TCCTTCTACAGGAGGTGTTCCGCTTTGCCTANTACAAGCTGCTTAAGAAGGCAGATGAGGGG 
TTAGCATNGCTGAGTGAGGACGGAAGATCACCCATTTCCATCCGCCAGATGGCCTATGTTTN 
TGGTNTTTCCTTCGGTATCATCAGTGGTGTTTTNTCTGTTATCAATATTTTGGNTGATGCAN 
TTGGGCCAGGTGTGGTTGGGATCCATGGAGANTCACCCTATTAATTCCTGAATTCAGCCTTT 
NTGACAGCAGCCATTATCCTGNTCCATACCTTTTGGGGAGTTGTGTTTTTTGATGCCTGTGA 
GAGGAG 
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FIGURE 223 

NGTTGGAGAAGTGGCGCGGACNTTCATTTGGGGTTTCGGTTTCCCCCCTTTCCCTTTCCCCG 
GGGTCTGGGGTGACATTGCACGGGCCCCTCGTGGGGTCGCGTTGCCACCCCACGCGGACTCC 
CCAGNTGGNGCGCCCTTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCCTTCCCACNTG 
ACCAGCCATGGGGGCTGCGGTGTTTTTCGGCTGCACTTTCGTCGCGTTCGGCCCGGCCTTCG 
CGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGTCGCAGGGGCA 
TTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGGTCCATGTGAC 
CGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCTGTCTCTGTCC 
TTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGATGAGGGGTTA 
GCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCTATGTTTCTGG 
TCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCTGATGCACTTG 
GGCCAGGTGTGGTTGGGATCCATGGAGACTCACCC 
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FIGURE 224 

GTAAAAGAAAGTGGCCGGACCTTCATTGGGGTTTCGGTTCCCCCCTTTCCCNTTCCCCGGGG 
TCTGGGGGTGACATTGCACCGCGCCCNTCGTGGGGTCGCGTTGCCACCCCACGCGGACTCCC 
CAGNTGGCGCGCCCCTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCCTTCCCACCTGA 
CCAGCCATGGGGGCTGCGGTGTTTTTCGGGCTGCACTTTCGTCGCGTTCGGGCCCGGCCTTC 
GCGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGTCGCAGGGGC 
ATTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGGTCCATGTGA 
CCGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCTGTCTCTGTC 
CTTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGATGAGGGGTT 
AGCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCTATGTTTCTG 
GTCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCTGATGCACTT 
GGGCCAGGTGTGGTTGGGATCCATGGAGAC 
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FIGURE 225 

GCCCCAGGGAGCAGTGGGTGGTTATAACTCAGGCCCGGTGCCCAGAGCCCAGGAGGAGGCAG 
TGGCCAGGAAGGCACAGGCCTGAGAAGTCTGCGGCTGAGCTGGGAGCAAATCCCCCACCCCC 
TACCTGGGGGACAGGGCAAGTGAGACCTGGTGAGGGTGGCTCAGCAGGCAGGGAAGGAGAGG 
TGTCTGTGCGTCCTGCACCCACATCTTTCTCTGTCCCCTCCTTGCCCTGTCTGGAGGCTGCT 
AGACTCCTATCTTCTGAATTCTATAGTGCCTGGGTCTCAGCGCAGTGCCGATGGTGGCCCGT 
CCTTGTGGTTCCTCTCTACCTGGGGAAATAAGGTGCAGCGGCCATGGCTACAGCAAGACCCC 
CCTGGATGTGGGTGCTCTGTGCTCTGATCACAGCCTTGCTTCTGGGGGTCACAGAGCATGTT 
CTCGCCAACAATGATGTTTCCTGTGACCACCCCTCTAACACCGTGCCCTCTGGGAGCAACCA 
GGACCTGGGAGCTGGGGCCGGGGAAGACGCCCGGTCGGATGACAGCAGCAGCCGCATCATCA 
ATGGATCCGACTGCGATATGCACACCCAGCCGTGGCAGGCCGCGCTGTTGCTAAGGCCCAAC 
CAGCTCTACTGCGGGGCGGTGTTGGTGCATCCACAGTGGCTGCTCACGGCCGCCCACTGCAG 
GAAGAAAGTTTTCAGAGTCCGTCTCGGCCACTACTCCCTGTCACCAGTTTATGAATCTGGGC 
AGCAGATGTTCCAGGGGGTCAAATCCATCCCCCACCCTGGCTACTCCCACCCTGGCCACTCT 
AACGACCTCATGCTCATCAAACTGAACAGAAGAATTCGTCCCACTAAAGATGTCAGACCCAT 
CAACGTCTCCTCTCATTGTCCCTCTGCTGGGACAAAGTGCTTGGTGTCTGGCTGGGGGACAA 
CCAAGAGCCCCCAAGTGCACTTCCCTAAGGTCCTCCAGTGCTTGAATATCAGCGTGCTAAG'T 
CAGAAAAGGTGCGAGGATGCTTACCCGAGACAGATAGATGACACCATGTTCTGCGCCGGTGA 
CAAAGCAGGTAGAGACTCCTGCCAGGGTGATTCTGGGGGGCCTGTGGTCTGCAATGGCTCCC 
TGCAGGGACTCGTGTCCTGGGGAGATTACCCTTGTGCCCGGCCCAACAGACCGGGTGTCTAC 
ACGAACCTCTGCAAGTTCACCAAGTGGATCCAGGAAACCATCCAGGCCAACTCCTGAGTCAT 
CCCAGGACTCAGCACACCGGCATCCCCACCTGCTGCAGGGACAGCCCTGACACTCCTTTCAG 
ACCCTCATTCCTTCCCAGAGATGTTGAGAATGTTCATCTCTCCAGCCCCTGACCCCATGTCT 
CCTGGACTCAGGGTCTGCTTCCCCCACATTGGGCTGACCGTGTCTCTCTAGTTGAACCCTGG 
GAACAATTTCCAAAACTGTCCAGGGCGGGGGTTGCGTCTCAATCTCCCTGGGGCACTTTCAT 
CCTCAAGCTCAGGGCCCATCCCTTCTCTGCAGCTCTGACCCAAATTTAGTCCCAGAAATAAA 
CTGAGAAGTGGAAAAAAAAA 
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FIGURE 226 

MATARPPWMWVL 

SSSRIINGSDCDMHTQPWQAALLLRPNQLYCGAVLVHPQWLLTAAHCRKKVFRVRLGHYSLS 
PVYESGQQMFQGVKSIPHPGYSHPGHSNDLMLIKLNRRIRPTKDVRPINVSSHCPSAGTKCL 
VSGWGTTKSPQVHFPKVLQCLNISVLSQKRCEDAYPRQIDDTMFCAGDKAGRDSCQGDSGGP 
WCNGSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWIQETIQANS 
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FIGURE 227 



ATGGTCAACGACCGGTGGAAGACCATGGGCGGCGCTGCCCAACTTGAGGACCGGCCGCGCGA 

CAAGCCGCAGCGGCCGAGCTGCGGCTACGTGCTGTGCACCGTGCTGCTGGCCCTGGCTGTGC 

TGCTGGCTGTAGCTGTCACCGGTGCCGTGCTCTTCCTGAACCACGCCCACGCGCCGGGCACG 

GCGCCCCCACCTGTCGTCAGCACTGGGGCTGCCAGCGCCAACAGCGCCCTGGTCACTGTGGA 

AAGGGCGGACAGCTCGCACCTCAGCATCCTCATTGACCCGCGCTGCCCCGACCTCACCGACA 

GCTTCGCACGCCTGGAGAGCGCCCAGGCCTCGGTGCTGCAGGCGCTGACAGAGCACCAGGCC 

CAGCCACGGCTGGTGGGCGACCAGGAGCAGGAGCTGCTGGACACGCTGGCCGACCAGCTGCC 

CCGGCTGCTGGCCCGAGCCTCAGAGCTGCAGACGGAGTGCATGGGGCTGCGGAAGGGGCATG 

GCACGCTGGGCCAGGGCCTCAGCGCCCTGCAGAGTGAGCAGGGCCGCCTCATCCAGCTTCTC 

TCTGAGAGCCAGGGCCACATGGCTGACCTGGTGAACTCCGTCAGCGACATCCTGGATGCCCT 

GCAGAGGGACCGGGGGCTGGGCCGGCCCCGCAACAAGGCCGACCTTCAGAGAGCGCCTGCCC 

GGGGAACCCGGCCCCGGGGCTGTGCCACTGGCTCCCGGCCCCGAGACTGTCTGGACGTCCTC 

CTAAGCGGACAGCAGGACGATGGCGTCTACTCTGTCTTTCCCACCCACTACCCGGCCGGCTT 

CCAGGTGTACTGTGACATGCGCACGGACGGCGGCGGCTGGACGGTGTTTCAGCGCCGGGAGG 

ACGGCTCCGTGAACTTCTTCCGGGGCTGGGACGCGTACCGAGACGGCTTTGGCAGGCTCACC 

GGGGAGCACTGGCTAGGGCTCAAGAGGATCCACGCCCTGACCACACAGGCTGCCTACGAGCT 

GCACGTGGACCTGGAGGACTTTGAGAATGGCACGGCCTATGCCCGCTACGGGAGCTTCGGCG 

TGGGCTTGTTCTCCGTGGACCCTGAGGAAGACGGGTACCCGCTCACCGTGGCTGACTATTCC 

GGCACTGCAGGCGACTCCCTCCTGAAGCACAGCGGCATGAGGTTCACCACCAAGGACCGTGA 

CAGCGACCATTCAGAGAACAACTGTGCCGCCTTCTACCGCGGTGCCTGGTGGTACCGCAACT 

GCCACACGTCCAACCTCAATGGGCAGTACCTGCGCGGTGCGCACGCCTCCTATGCCGACGGC 

GTGGAGTGGTCCTCCTGGACCGGCTGGCAGTACTCACTCAAGTTCTCTGAGATGAAGATCCG 

GCCGGTCCGGGAGGACCGCTAGACTGGTGCACCTTGTCCTTGGCCCTGCTGGTCCCTGTCGC 

CCCATCCCCGACCCCACCTCACTCTTTCGTGAATGTTCTCCACCCACCTGTGCCTGGCGGAC 

CCACTCTCCAGTAGGGAGGGGCCGGGCCATCCCTGACACGAAGCTCCCTGGGCCGGTGAAGT 

CACACATCGCCTTCTCGCCGTCCCCACCCCCTCCATTTGGCAGCTCACTGATCTCTTGCCTC 

TGCTGATGGGGGCTGGCAAACTTGACGACCCCAACTCCTGCCTGCCCCCACTGTGACTCCGG 

TGCTGTTTGCCGTCCCCTGGCCAGGATGGTGGAGTCTGCCCCAGGCACCCTCTGCCCTGCCC 

GGCCAAATACCCGGCATTATGGGGACAGAGAGCAGGGGGCAGACAGCACCCCTGGAGTCCTC 

CTAGCAGATCGTGGGGAATGTCAGGTCTCTCTGAGGTCAGGTCTGAGGCCAGTATCCTCCAG 

CCCTCCCAATGCCAACCCCCACCCCGTTTCCCTGGTGCCCAGAGAACCCACCTCTCCCCCAA 

GGGCCTCAGCCTGGCTGTGGGCTGGGTGGCCCCATCCTACCAGGCCCTGAGGTCAGGATGGG 

GAGCTGCTGCCTTTGGGGACCCACGCTCCAAGGCTGAGACCAGTTCCCTGGAGGCCACCCAC 

CCTGTGCCCCGGCAGGCCTGGGGTCTGCAGTCCTCTTACCTGCTGTGCCCACCTGCTCTCTG 

TCTCAAATGAGGCCCAACCCATCCCCCACCCAGCTCCCGGCCGTCCTCCTACCTGGGGCAGC 

CGGGGCTGCCATCCCATTTCTCCTGCCTCTGGAAGGTGGGTGGGGCCCTGCACCGTGGGGCT 

GGACTGCGCTAATGGGAAGCTCTTGGTTTTCTGGGCTGGGGCCTAGGCAGGGCTGGGATGAG 

GCTTGTACAACCCCCACCACCAATTTCCCAGGGACTCCAGGGTCCTGAGGCCTCCCAGGAGG 

GCCTTGGGGGTGATGACCCCTTCCCTGAGGTGGCTGTCTCCATGAGGAGGCCAACCCTTGCC 

ATTGACCGTGGCCACCTGGACCCAGGCCAGGCCCGGCCCGGCGAGTGGTCAAGGGACAGGGA 

CCACCTCACCGGGCAAATGGGGTCGGGGGGACTGGGGCACCAGACCAGGCACCACCTGGACA 

CTTTCTTGTTGAATCCTCCCAACACCCAGCACGCTGTCATCCCCACTCCTTGTGTGCACACA 

TGCAGAGGTGAGACCCGCAGGCTCCCAGGACCAGCAGCCACAAGGGCAGGGCTGGAGCCGGG 

TCCTCAGCTGTCTGCTCAGCAGCCCTGGACCCGCGTGCGTTACGTCAGGCCCAGATGCAGGG 

CGGCTTTTCCAAGGCCTCCTGATGGGGGCCTCCGAAAGGGCTGGAGTCAGCCTTGGGGAGCT 

GCCTAGCAGCCTCTCCTCGGGCAGGAGGGGAGGTGGCTTCCTCCAAAGGACACCCGATGGCA 

GGTGCCTAGGGGGTGTGGGGTTCCGTTCTCCCTTCCCCTCCCACTGAAGTTTGTGCTTAAAA 

AACAATAAATTTGACTTGGCACCACTGGGGGTTGGTGGGAGAGGCCGTGTGACCTGGCTCTC 

TGTCCCAGTGCCACCAGGTCATCCACATGCGCAG 
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FIGURE 228 

MVNDRWKTMGGAAQLEDRPRDKPQRPSCGYVLCTVLIJ^ 

APPPWSTGAASANSALVTVERADSSHLSILIDPRCPDLTDSFARLESAQASVLQALTEHQA 
QPRLVGDQEQELLDTLADQLPRLLARASELQTECMGLRKGHGTLGQGLSALQSEQGRLIQLL 
SESQGHMAHLWSVSDILDALQRDRGLGRPRNKADLQRAPARGTRPRGCATGSRPRDCLDVL 
LSGQQDDGVYSVFPTHYPAGFQVYCDMRTDGGGWTVFQRREDGSVNFFRGWDAYRDGFGRLT 
GEHWLGLKRIHALTTQAAYELHVDLEDFENGTAYARYGSFGVGLFSVDPEEDGYPLTVADYS 
GTAGDSLLKHSGMRFTTKDRDSDHSENNCAAFYRGAWWYRNCHTSNLNGQYLRGAHASYADG 
VEWSSWTGWQYSLKFSEMKIRPVREDR 
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FIGURE 229 

GCAGTCAGAGACTTCCCCTGCCCCTCGCTGGGAAAGAACATTAGGAATGCCTTTTAGTGCCT 
TGCTTCCTGAACTAGCTCACAGTAGCCCGGCGGCCCAGGGCAATCCGACCACATTTCACTCT 
CACCGCTGTAGGAATCCAGATGCAGGCCAAGTACAGCAGCACGAGGGACATGCTGGATGATG 
ATGGGGACACCACCATGAGCCTGCATTCTCAAGCCTCTGCCACAACTCGGCATCCAGAGCCC 
CGGCGCACAGAGCACAGGGCTCCCTCTTCAACGTGGCGACCAGTGGCCCTGACCCTGCTGAC 
TTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTACTACC 
AGCTCTCCAATACTGGTCAAGACACCATTTCTCAAATGGAAGAAAGATTAGGAAATACGTCC 
CAAGAGTTGCAATCTCTTCAAGTCCAGAATATAAAGCTTGCAGGAAGTCTGCAGCATGTGGC 
TGAAAAACTCTGTCGTGAGCTGTATAACAAAGCTGGAGCACACAGGTGCAGCCCTTGTACAG 
AACAATGGAAATGGCATGGAGACAATTGCTACCAGTTCTATAAAGACAGCTUyyVGTTGGGAG 
GACTGTAAATATTTCTGCCTTAGTGAAAACTCTACCATGCTGAAGATAAACAAACAAGAAGA 
CCTGGAATTTGCCGCGTCTCAGAGCTACTCTGAGTTTTTCTACTCTTATTGGACAGGGCTTT 
TGCGCCCTGACAGTGGCAAGGCCTGGCTGTGGATGGATGGAACCCCTTTCACTTCTGAACTG 
TTCCATATTATAATAGATGTCACCAGCCCAAGAAGCAGAGACTGTGTGGCCATCCTCAATGG 
GATGATCTTCTCAAAGGACTGCAAAGAATTGAAGCGTTGTGTCTGTGAGAGAAGGGCAGGAA 
TGGTGAAGCCAGAGAGCCTCCATGTCCCCCCTGAAACATTAGGGGAAGGTGACTGATTCGCC 
CTCTGCAACTACAAATAGCAGAGTGAGCCAGGCGGTGCCAAAGCAAGGGCTAGTTGAGACAT 
TGGGAAATGGAACATAATCAGGAAAGACTATCTCTCTGACTAGTACAAAATGGGTTCTCGTG 
TTTCCTGTTCAGGATCACCAGCATTTCTGAGCTTGGGTTTATGCACGTATTTAACAGTCACA 
AGAAGTCTTATTTACATGCCACCAACCAACCTCAGAAACCCATAATGTCATCTGCCTTCTTG 
GCTTAGAGATAACTTTTAGCTCTCTTTCTTCTCAATGTCTAATATCACCTCCCTGTTTTCAT 
GTCTTCCTTACACTTGGTGGAATAAGAAACTTTTTGAAGTAGAGGAAATACATTGAGGTAAC 
ATCCTTTTCTCTGACAGTCAAGTAGTCCATCAGAAATTGGCAGTCACTTCCCAGATTGTACC 
AGCAAATACACAAGGAATTCTTTTTGTTTGTTTCAGTTCATACTAGTCCCTTCCCAATCCAT 
CAGTAAAGACCCCATCTGCCTTGTCCATGCGGTTTCCCAACAGGGATGTCACTTGATATGAG 
AATCTCAAATCTCAATGCCTTATAAGCATTCCTTCCTGTGTCCATTAAGACTCTGATAATTG 
TCTCCCCTCCATAGGAATTTCTCCCAGGAAAGAAATATATCCCCATCTCCGTTTCATATCAG 
AACTACCGTCCCCGATATTCCCTTCAGAGAGATTAAAGACCAGAAAAAAGTGAGCCTCTTCA 
TCTGCACCTGTAATAGTTTCAGTTCCTATTTTCTTCCATTGACCCATATTTATACCTTTCAG 
GTACTGAAGATTTAATAATAATAAATGTAAATACTGTGAAAAA 
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FIGURE 230 

MQAKYSSTRDMLDDDGDTTMSLHSQASATTRHPEPRRTEHRAPSSTWRPVALTLLTLCLVLL 
IGLAALGLLFFQYYQLSNTGQDTISQMEERLGNTSQELQSLQVQNIKLAGSLQHVAEKLCRE 
LYNKAGAHRCSPCTEQWKWHGDNCYQFYKDSKSWEDCKYFCLSENSTMLKINKQEDLEFAAS 
QSYSEFFYSYWTGLLRPDSGKAWLWMDGTPFTSELFHIIIDVTSPRSRDCVAILNGMIFSKD 
CKELKRCVCERRAGMVKPESLHVPPETLGEGD 
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FIGURE 231 

AATTTTCACCGCTGTAGGAATCCAGATGCAGGCCAAGTACAGCAGCACGAGGGACATGNTGG 
ATGATGATGGGACACCACCATGAGCCTGCATTNTCAAGCTTTTGCCACAATTCGGCATCCAG 
AGCCCCGGCGCACAGAGCACAGGGNTCCTTTTTCAACGTGGCGACCAGTGGCCCTGACCCTG 
CTGACTTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTA 
CTACCAGCTCTCCAATACTGGTCAAGACACCATTTCTCAAATGGAAGAAAGATTAGGAAATA 
CGTCCCAAGAGTTGCAATTTNTTCAAGTCCAGAATATAAAGCTTGCAGGAAGTNTGCAGCAT 
GTGGCTGAAAAACTCTGTCGTGAGCTGTATAACAAAGCTGGAGGAACTTTGAAGGAGGGCAA 
AGTNTCCTCATNTACTATACACACACCACTTCCC 
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FIGURE 232 

GCCGAGCGCAAGAACCCTGCGCAGCCCAGAGCAGCTGCTGGAGGGGAATCGAGGCGCGGCTC 
CGGGGATTCGGCTCGGGCCGCTGGCTCTGCTCTGCGGGGAGGGAGCGGGCCCGCCCGCGGGG 
CCCGAGCCCTCCGGATCCGCCCCCTCCCCGGTCCCGCCCCCTCGGAGACTCCTCTGGCTGCT 
CTGGGGGTTCGCCGGGGCCGGGGACCCGCGGTCCGGGCGCCATGCGGGCATCGCTGCTGCTG 
TCGGTGCTGCGGCCCGCAGGGCCCGTGGCCGTGGGCATCTCCCTGGGCTTCACCCTGAGCCT 
GCTCAGCGTCACCTGGGTGGAGGAGCCGTGCGGCCCAGGCCCGCCCCAACCTGGAGACTCTG 
AGCTGCCGCCGCGCGGCAACACCAACGCGGCGCGCCGGCCCAACTCGGTGCAGCCCGGAGCG 
GAGCGCGAGAAGCCCGGGGCCGGCGAAGGCGCCGGGGAGAATTGGGAGCCGCGCGTCTTGCC 
CTACCACCCTGCACAGCCCGGCCAGGCCGCCAAAAAGGCCGTCAGGACCCGCTACATCAGCA 
CGGAGCTGGGCATCAGGCAGAGGCTGCTGGTGGCGGTGCTGACCTCTCAGACCACGCTGCCC 
ACGCTGGGCGTGGCCGTGAACCGCACGCTGGGGCACCGGCTGGAGCGTGTGGTGTTCCTGAC 
GGGCGCACGGGGCCGCCGGGCCCCACCTGGCATGGCAGTGGTGACGCTGGGCGAGGAGCGAC 
CCATTGGACACCTGCACCTGGCGCTGCGCCACCTGCTGGAGCAGCACGGCGACGACTTTGAC 
TGGTTCTTCCTGGTGCCTGACACCACCTACACCGAGGCGCACGGCCTGGCACGCCTAACTGG 
CCACCTCAGCCTGGCCTCCGCCGCCCACCTGTACCTGGGCCGGCCCCAGGACTTCATCGGCG 
GAGAGCCCACCCCCGGCCGCTACTGCCACGGAGGCTTTGGGGTGCTGCTGTCGCGCATGCTG 
CTGCAACAACTGCGCCCCCACCTGGAAGGCTGCCGCAACGACATCGTCAGTGCGCGCCCTGA 
CGAGTGGCTGGGTCGCTGCATTCTCGATGCCACCGGGGTGGGCTGCACTGGTGACCACGAGG 
GGGTGCACTATAGCCATCTGGAGCTGAGCCCTGGGGAGCCAGTGCAGGAGGGGGACCCTCAT 
TTCCGAAGTGCCCTGACAGCCCACCCTGTGCGTGACCCTGTGCACATGTACCAGCTGCACAA 
AGCTTTCGCCCGAGCTGAACTGGAACGCACGTACCAGGAGATCCAGGAGTTACAGTGGGAGA 
TCCAGAATACCAGCCATCTGGCCGTTGATGGGGACCGGGCAGCTGCTTGGCCCGTGGGTATT 
CCAGCACCATCCCGCCCGGCCTCCCGCTTTGAGGTGCTGCGCTGGGACTACTTCACGGAGCA 
GCACGCTTTCTCCTGCGCCGATGGCTCACCCCGCTGCCCACTGCGTGGGGCTGACCGGGCTG 
ATGTGGCCGATGTTCTGGGGACAGCTCTAGAGGAGCTGAACCGCCGCTACCACCCGGCCTTG 
CGGCTCCAGAAGCAGCAGCTGGTGAATGGCTACCGACGCTTTGATCCGGCCCGGGGTATGGA 
ATACACGCTGGACTTGCAGCTGGAGGCACTGACCCCCCAGGGAGGCCGCCGGCCCCTCACTC 
GCCGAGTGCAGCTGCTCCGGCCGCTGAGCCGCGTGGAGATCTTGCCTGTGCCCTATGTCACT 
GAGGCCTCACGTCTCACTGTGCTGCTGCCTCTAGCTGCGGCTGAGCGTGACCTGGCCCCTGG 
CTTCTTGGAGGCCTTTGCCACTGCAGCACTGGAGCCTGGTGATGCTGCGGCAGCCCTGACCC 
TGCTGCTACTGTATGAGCCGCGCCAGGCCCAGCGCGTGGCCCATGCAGATGTCTTCGCACCT 
GTCAAGGCCCACGTGGCAGAGCTGGAGCGGCGTTTCCCCGGTGCCCGGGTGCCATGGCTCAG 
TGTGCAGACAGCCGCACCCTCACCACTGCGCCTCATGGATCTACTCTCCAAGAAGCACCCGC 
TGGACACACTGTTCCTGCTGGCCGGGCCAGACACGGTGCTCACGCCTGACTTCCTGAACCGC 
TGCCGCATGCATGCCATCTCCGGCTGGCAGGCCTTCTTTCCCATGCATTTCCAAGCCTTCCA 
CCCAGGTGTGGCCCCACCACAAGGGCCTGGGCCCCCAGAGCTGGGCCGTGACACTGGCCGCT 
TTGATCGCCAGGCAGCCAGCGAGGCCTGCTTCTACAACTCCGACTACGTGGCAGCCCGTGGG 
CGCCTGGCGGCAGCCTCAGAACAAGAAGAGGAGCTGCTGGAGAGCCTGGATGTGTACGAGCT 
GTTCCTCCACTTCTCCAGTCTGCATGTGCTGCGGGCGGTGGAGCCGGCGCTGCTGCAGCGCT 
ACCGGGCCCAGACGTGCAGCGCGAGGCTCAGTGAGGACCTGTACCACCGCTGCCTCCAGAGC 
GTGCTTGAGGGCCTCGGCTCCCGAACCCAGCTGGCCATGCTACTCTTTGAACAGGAGCAGGG 
CAACAGCACCTGACCCCACCCTGTCCCCGTGGGCCGTGGCATGGCCACACCCCACCCCACTT 
CTCCCCCAAAACCAGAGCCACCTGCCAGCCTCGCTGGGCAGGGCTGGCCGTAGCCAGACCCC 
AAGCTGGCCCACTGGTCCCCTCTCTGGCTCTGTGGGTCCCTGGGCTCTGGACAAGCACTGGG 
GGACGTGCCCCCAGAGCCACCCACTTCTCATCCCAAACCCAGTTTCCCTGCCCCCTGACGCT 
GCTGATTCGGGCTGTGGCCTCCACGTATTTATGCAGTACAGTCTGCCTGACGCCAGCCCTGC 
CTCTGGGCCCTGGGGGCTGGGCTGTAGAAGAGTTGTTGGGGAAGGAGGGAGCTGAGGAGGGG 
GCATCTCCCAACTTCTCCCTTTTGGACCCTGCCGAAGCTCCCTGCCTTTAATAAACTGGCCA 
AGTGTGGAAAAA 
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FIGURE 233 

MRASLLLSVLRPAGPVAVGISLGFTLSLLSVTWVEEPCGPGPPQPGDSELPPRGNTNAARRP 
NSVQPGAEREKPGAGEGAGENWEPRVLP YHPAQPGQAAKKAVRTRYI STELG IRQRLLVAVL 
TSQTTLPTLGVAVNRTLGHRLERWFLTGARGRRAPPGMAWTLGEERPIGHLHLALRHLLE 
QHGDDFDWFFLVPDTTYTEAHGLARLTGHLSLASAAHLYLGRPQDFIGGEPTPGRYCHGGFG 
VLLSRMLLQQLRPHLEGCRNDIVSARPDEWLGRCILDATGVGCTGDHEGVHYSHLELSPGEP 
VQEGDPHFRS ALTAHPVRDP VHMYQLHKAFARAELERTYQE IQELQWE IQNTSHLAVDGDRA 
AAWPVGIPAPSRPASRFEVLRWDYFTEQHAFSCADGSPRCPLRGADRADVADVLGTALEELN 
RRYHPALRLQKQQLVNGYRRFDPARGMEYTLDLQLEALTPQGGRRPLTRRVQLLRPLSRVEI 
LPVPWTEASRLTVLLPLAAAERDLAPGFLEAFATAALEPGDAAAALTLLLLYEPRQAQRV^ 
HADVFAPVKAHVAELERRFPGARVPWLSVQTAAPSPLRLMDLLSKKHPLDTLFLLAGPDTVL 
TPDFLNRCRMHAISGWQAFFPMHFQAFHPGVAPPQGPGPPELGRDTGRFDRQAASEACFYNS 
DYVAARGRLAAASEQEEELLESLDVYELFLHFSSLHVLRAVEPALLQRYRAQTCSARLSEDL 
YHRCLQSVLEGLGSRTQLAMLLFEQEQGNST 
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GCTCTGGCCGGCCCCGGCGATTGGTCACCGCCCGCTAGGGGACAGCCCTGGCCTCCTCTGAT 
TGGCAAGCGCTGGCCACCTCCCCACACCCCTTGCGAACGCTCCCCTAGTGGAGAAAAGGAGT 
AGCTATTAGCCAATTCGGCAGGGCCCGCTTTTTAGAAGCTTGATTTCCTTTGAAGATGAAAG 
ACTAGCGGAAGCTCTGCCTCTTTCCCCAGTGGGCGAGGGAACTCGGGGCGATTGGCTGGGAA 
CTGTATCCACCCAAATGTCACCGATTTCTTCCTATGCAGGAAATGAGCAGACCCATCAATAA 
GAAATTTCTCAGCCTGGCCGAAAATGGTTGGCCCCACGAAGCCACGACAACTGGAGGCAAAG 
AGGGTTGCTCAACGCCCCGCCTCATTGGAAAACCAAATCAGATCTGGGACCTATATAGCGTG 
GCGGAGGCGGGGCGATGATTGTCGCGCTCGCACCCACTGCAGCTGCGCACAGTCGCATTTCT 
TTCCCCGCCCCTGAGACCCTGCAGCACCATCTGTCATGGCGGCTGGGCTGTTTGGTTTGAGC 
GCTCGCCGTCTTTTGGCGGCAGCGGCGACGCGAGGGCTCCCGGCCGCCCGCGTCCGCTGGGA 
ATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCTGTGGCGGGAAAGCGGCCCCCAGAAC 
CGACCACACCGTGGCAAGAGGACCCAGAACCCGAGGACGAAAACTTGTATGAGAAGAACCCA 
GACTCCCATGGTTATGACAAGGACGCCGTTTTGGACGTCTGGAACATGCGACTTGTCTTCTT 
CTTTGGCGTCTCCATCATCCTGGTCCTTGGCAGCACCTTTGTGGCCTATCTGCCTGACTACA 
GGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCTTGTGAAATACCGAGAGGCCAATGGC 
CTTCCCATCATGGAATCCAACTGCTTCGACCCCAGCAAGATCCAGCTGCCAGAGGATGAGTG 
ACCAGTTGCTAAGTGGGGCTCAAGAAGCACCGCCTTCCCCACCCCCTGCCTGCCATTCTGAC 
CTCTTCTCAGAGCACCTAATTAAAGGGGCTGAAAGTCTGAA 
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FIGURE 235 

MAAGLFGLSARRLLAAAATRGLPAARVRWESSFSRTVVAPSAVAGKRPPEPTTPWQEDPEPE 
DENL YE KN PD SHG YDKD P VLD VWNMRLVF F FG VS I ILVLGSTFVAYLPDYRMKEWSRREAER 
LVKYREANGLPIMESNCFDPSKIQLPEDE 
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GGCGGCTGGGCTGTTTGGTTTGAGCGCTCGCCGTCTTTTGGCGGCAGCGGCGACGCGAGGGC 
TCCCGGCCGCCCGCGTCCGCTGGGAATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCT 
GTGGCGGGAAAGCGGCCCCCAGAACCGACCACACCGTGGCAAGAGGACCCAGAACCCGAGGA 
CGAAAACTTGTATGAGAAGAACCCAGACTCCCATGGTTATGACAAGGACCCCGTTTTGGACG 
TCTGGAACATGCGACTTGTCTTCTTCTTTGGCGTCTCCATCATCCTGGTCCTTGGCAGCACC 
TTTGTGGCCTATCTGCCTGACTACAGGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCT 
TGTGAAATACCGAGAGGCCAATGGCCTTCCCATCATGGAATCCAACTGCTTCGACCCCAGCA 
AGATCCAG 
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FIGURE 237 

GCGGCGGCTATGCCGCTTGCTCTGCTCGTCCTGTTGCTCCTGGGGCCCGGCGGCTGGTGCCT 
TGCAGAACCCCCACGCGACAGCCTGCGGGAGGAACTTGTCATCACCCCGCTGCCTTCCGGGG 
ACGTAGCCGCCACATTCCAGTTCCGCACGCGCTGGGATTCGGAGCTTCAGCGGGAAGGAGTG 
TCCCATTACAGGCTCTTTCCCAAAGCCCTGGGGCAGCTGATCTCCAAGTATTCTCTACGGGA 
GCTGCACCTGTCATTCACACAAGGCTTTTGGAGGACCCGATACTGGGGGCCACCCTTCCTGC 
AGGCCCCATCAGGTGCAGAGCTGTGGGTCTGGTTCCAAGACACTGTCACTGATGTGGATAAA 
TCTTGGAAGGAGCTCAGTAATGTCCTCTCAGGGATCTTCTGCGCCTCTCTCAACTTCATCGA 
CTCCACCAACACAGTCACTCCCACTGCCTCCTTCAAACCCCTGGGTCTGGCCAATGACACTG 
ACCACTACTTTCTGCGCTATGCTGTGCTGCCGCGGGAGGTGGTCTGCACCGAAAACCTCACC 
CCCTGGAAGAAGCTCTTGCCCTGTAGTTCCAAGGCAGGCCTCTCTGTGCTGCTGAAGGCAGA 
TCGCTTGTTCCACACCAGCTACCACTCCCAGGCAGTGCATATCCGCCCTGTTTGCAGAAATG 
CACGCTGTACTAGCATCTCCTGGGAGCTGAGGCAGACCCTGTCAGTTGTATTTGATGCCTTC 
ATCACGGGGCAGGGAAAGAAAGACTGGTCCCTCTTCCGGATGTTCTCCCGAACCCTCACGGA 
GCCCTGCCCCCTGGCTTCAGAGAGCCGAGTCTATGTGGACATCACCACCTACAACCAGGACA 
ACGAGACATTAGAGGTGCACCCACCCCCGACCACTACATATCAGGACGTCATCCTAGGCACT 
CGGAAGACCTATGCCATCTATGACTTGCTTGACACCGCCATGATCAACAACTCTCGAAACCT 
CAACATCCAGCTCAAGTGGAAGAGACCCCCAGAGAATGAGGCCCCCCCAGTGCCCTTCCTGC 
ATGCCCAGCGGTACGTGAGTGGCTATGGGCTGCAGAAGGGGGAGCTGAGCACACTGCTGTAC 
AACACCCACCCATACCGGGCCTTCCCGGTGCTGCTGCTGGACACCGTACCCTGGTATCTGCG 
GCTGTATGTGCACACCCTCACCATCACCTCCAAGGGCAAGGAGAACAAACCAAGTTACATCC 
ACTACCAGCCTGCCCAGGACCGGCTGCAACCCCACCTCCTGGAGATGCTGATTCAGCTGCCG 
GCCAACTCAGTCACCAAGGTTTCCATCCAGTTTGAGCGGGCGCTGCTGAAGTGGACCGAGTA 
CACGCCAGATCCTAACCATGGCTTCTATGTCAGCCCATCTGTCCTCAGCGCCCTTGTGCCCA 
GCATGGTAGCAGCCAAGCCAGTGGACTGGGAAGAGAGTCCCCTCTTCAACAGCCTGTTCCCA 
GTCTCTGATGGCTCTAACTACTTTGTGCGGCTCTACACGGAGCCGCTGCTGGTGAACCTGCC 
GACACCGGACTTCAGCATGCCCTACAACGTGATCTGCCTCACGTGCACTGTGGTGGCCGTGT 
GCTACGGCTCCTTCTACAATCTCCTCACCCGAACCTTCCACATCGAGGAGCCCCGCACAGGT 
GGCCTGGCCAAGCGGCTGGCCAACCTTATCCGGCGCGCCCGAGGTGTCCCCCCACTCTGATT 
CTTGCCCTTTCCAGCAGCTGCAGCTGCCGTTTCTCTCTGGGGAGGGGAGCCCAAGGGCTGTT 
TCTGCCACTTGCTCTCCTCAGAGTTGGCTTTTGAACCAAAGTGCCCTGGACCAGGTCAGGGC 
CTACAGCTGTGTTGTCCAGTACAGGAGCCACGAGCCAAATGTGGCATTTGAATTTGAATTAA 
CTTAGAAATTCATTTCCTCACCTGTAGTGGCCACCTCTATATTGAGGTGCTCAATAAGCAAA 
AGTGGTCGGTGGCTGCTGTATTGGACAGCACAGAAAAAGATTTCCATCACCACAGAAAGGTC 
GGCTGGCAGCACTGGCCAAGGTGATGGGGTGTGCTACACAGTGTATGTCACTGTGTAGTGGA 
TGGAGTTTACTGTTTGTGGAATAAAAACGGCTGTTTCCGTGGAAAAAAAAAAT^A 
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FIGURE 238 

MPLALLVLLLLGPGGWCLAEPPRDSLREELVITPLPSGDVAATFQFRTRWDSELQREGVSHY 
RLFPKALGQLISKYSLRELHLSFTQGFWRTRYWGPPFLQAPSGAELWVWFQDTVTDVDKSWK 
ELSNVLSGIFCASLNFIDSTNTVTPTASFKPLGLANDTDHYFLRYAVLPREWCTENLTPWK 
KLLPCSSKAGLSVLLKADRLFHTSYHSQAVHIRPVCRNARCTSISWELRQTLSVVFDAFITG 
QGKKDWSLFRMFSRTLTEPCPLASESRVYVDITTYNQDNETLEVHPPPTTTYQDVILGTRKT 
YAIYDLLDTAMINNSRNLNIQLKWKRPPENEAPPVPFLHAQRYVSGYGLQKGELSTLLYNTH 
PYRAFPVLLLDTVPWYLRLYVHTLTITSKGKENKPSYIHYQPAQDRLQPHLLEMLIQLPANS 
VTKVS I QFERALLKWTEYTPDPNHGF YVS PS VLS ALVPSMVAAKPVDWEE S PLFNS LF P VS D 
GSNYFVRLYTEPLLVNLPTPDFSMPYNVICLTCTWAVCYGSFYNLLTRTFHIEEPRTGGLA 
KRLANL I RRARG VP PL 
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FIGURE 239 

CAACATGGGGTCCAGCAGCTTCTTGGTCCTCATGGTGTCTCTCGTTCTTGTGACCCTGGTGG 
CTGTGGAAGGAGTTAAAGAGGGTATAGAGAAAGCAGGGGTTTGCCCAGCTGACAACGTACGC 
TGCTTCAAGTCCGATCCTCCCCAGTGTCACACAGACCAGGACTGTCTGGGGGAAAGGAAGTG 
TTGTTACCTGCACTGTGGCTTCAAGTGTGTGATTCCTGTGAAGGAACTGGAAGAAGGAGGAA 
ACAAGGATGAAGATGTGTCAAGGCCATACCCTGAGCCAGGATGGGAGGCCAAGTGTCCAGGC 
TCCTCCTCTACCAGGTGTCCTCAGAAATGATGCTGGGTCCTTTCTACCTCTGGGGGTCACTC 
TCACTTGGCACCTGCCCCTGAGGGTCCTGAGACTTGGAATATGGAAGAAGCAATACCCAACC 
CCACCAAAGAAAACCTGAGCTTGAAGTCCTTTTCCCCAAAAAGAGGGAAGAGTCACAAAAAG 
TCCAGACCCCAGGGACGGTACTTTCCCTCTCTACCTGGTGCTCCTCCCTAATGCTCATGAAT 
GGACCCCTCATGAATGAAACCAGTGCCCTTATAAGAGACCCCAAAGAGCTGCCTTGCCCTTC 
TGCAATGTGTGATCACAGCTAGAAGGCACTGTCAGAGAAGAGAAACTGGTCCTCACCAGATG 
CTGAATCTGCTGGTGCCTTGATCTTGGACTTCCCAGCCTCTAGAACTGTAAGAAATAAATAT 
TTGCTGTTTATAATCCAA 
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FIGURE 240 



MGSSSFLVLMVSLVLVTLVAVEGVKEGIEKAGVCPADNVRCFKSDPPQCHTDQDCLGERKCC 
YLHCGFKCVrPVKELEEGGNKDEDVSRPYPEPGWEAKCPGSSSTRCPQK 
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FIGURE 241 

AAACTCAGCACTTGCCGGAGTGGCTCATTGTTAAGACAAAGGGTGTGCACTTCCTGGCCAGG 
AAACCTGAGCGGTGAGACTCCCAGCTGCCTACATCAAGGCCCCAGGACATGCAGAACCTTCC 
TCTAGAACCCGACCCACCACCATGAGGTCCTGCCTGTGGAGATGCAGGCACCTGAGCCAAGG 
CGTCCAGTGGTCCTTGCTTCTGGCTGTCCTGGTCTTCTTTCTCTTCGCCTTGCCCTCTTTTA 
TTAAGGAGCCTCAAACAAAGCCTTCCAGGCATCAACGCACAGAGAACATTAAAGAAAGGTCT 
CTACAGTCCCTGGCAAAGCCTAAGTCCCAGGCACCCACAAGGGCGAGGAGGACAACCATCTA 
TGCAGAGCCAGCGCCAGAGAACAATGCCCTCAACACACAAACCCAGCCCAAGGCCCACACCA 
CCGGAGACAGAGGAAAGGAGGCCAACCAGGCACCGCCGGAGGAGCAGGACAAGGTGCCCCAC 
ACAGCACAGAGGGCAGCATGGAAGAGCCCAGAAAAAGAGAAAACCATGGTGAACACACTGTC 
ACCCAGAGGGCAAGATGCAGGGATGGCCTCTGGCAGGACAGAGGCACAATCATGGAAGAGCC 
AGGACACAAAGACGACCCAAGGAAATGGGGGCCAGACCAGGAAGCTGACGGCCTCCAGGACG 
GTGTCAGAGAAGCACCAGGGCAAAGCGGCAACCACAGCCAAGACGCTCATTCCCAAAAGTCA 
GCACAGAATGCTGGCTCCCACAGGAGCAGTGTCAACAAGGACGAGACAGAAAGGAGTGACCA 
CAGCAGTCATCCCACCTAAGGAGAAGAAACCTCAGGCCACCCCACCCCCTGCCCCTTTCCAG 
AGCCCCACGACGCAGAGAAACCAAAGACTGAAGGCCGCCAACTTCAAATCTGAGCCTCGGTG 
GGATTTTGAGGAAAAATACAGCTTCGAAATAGGAGGCCTTCAGACGACTTGCCCTGACTCTG 
TGAAGATCAAAGCCTCCAAGTCGCTGTGGCTCCAGAAACTCTTTCTGCCCAACCTCACTCTC 
TTCCTGGACTCCAGACACTTCAACCAGAGTGAGTGGGACCGCCTGGAACACTTTGCACCACC 
CTTTGGCTTCATGGAGCTCAACTACTCCTTGGTGCAGAAGGTCGTGACACGCTTCCCTCCAG 
TGCCCCAGCAGCAGCTGCTCCTGGCCAGCCTCCCCGCTGGGAGCCTCCGGTGCATCACCTGT 
GCCGTGGTGGGCAACGGGGGCATCCTGAACAACTCCCACATGGGCCAGGAGATAGACAGTCA 
CGACTACGTGTTCCGATTGAGCGGAGCTCTCATTAAAGGCTACGAACAGGATGTGGGGACTC 
GGACATCCTTCTACGGCTTTACCGCCTTCTCCCTGACCCAGTCACTCCTTATATTGGGCAAT 
CGGGGTTTCAAGAACGTGCCTCTTGGGAAGGACGTCCGCTACTTGCACTTCCTGGAAGGCAC 
CCGGGACTATGAGTGGCTGGAAGCACTGCTTATGAATCAGACGGTGATGTCAAAAAACCTTT 
TCTGGTTCAGGCACAGACCCCAGGAAGCTTTTCGGGAAGCCCTGCACATGGACAGGTACCTG 
TTGCTGCACCCAGACTTTCTCCGATACATGAAGAACAGGTTTCTGAGGTCTAAGACCCTGGA 
TGGTGCCCACTGGAGGATATACCGCCCCACCACTGGGGCCCTCCTGCTGCTCACTGCCCTTC 
AGCTCTGTGACCAGGTGAGTGCTTATGGCTTCATCACTGAGGGCCATGAGCGCTTTTCTGAT 
CACTACTATGATACATCATGGAAGCGGCTGATCTTTTACATAAACCATGACTTCAAGCTGGA 
GAGAGAAGTCTGGAAGCGGCTACACGATGAAGGGATAATCCGGCTGTACCAGCGTCCTGGTC 
CCGGAACTGCCAAAGCCAAGAACTGACCGGGGCCAGGGCTGCCATGGTCTCCTTGCCTGCTC 
CAAGGCACAGGATACAGTGGGAATCTTGAGACTCTTTGGCCATTTCCCATGGCTCAGACTAA 
GCTCCAAGCCCTTCAGGAGTTCCAAGGGAACACTTGAACCATGGACAAGACTCTCTCAAGAT 
GGCAAATGGCTAATTGAGGTTCTGAAGTTCTTCAGTACATTGCTGTAGGTCCTGAGGCCAGG 
GATTTTTAATTAAATGGGGTGATGGGTGGCCAATACCACAATTCCTGCTGAAAAACACTCTT 
CCAGTCCAAAAGCTTCTTGATACAGAAAAAAGAGCCTGGATTTACAGAAACATATAGATCTG 
GTTTGAATTCCAGATCGAGTTTACAGTTGTGAAATCTTGAAGGTATTACTTAACTTCACTAC 
AGATTGTCTAGAAGACCTTTCTAGGAGTTATCTGATTCTAGAAGGGTCTATACTTGTCCTTG 
TCTTTAAGCTATTTGACAACTCTACGTGTTGTAGAAAACTGATAATAATACAAATGATTGTT 
GTCCATGGAAAGGCAAATAAATTTTCTACAGTGAAAAAAAAAAAAAAA 
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FIGURE 242 

MRSCLWRCRHLSQGVQWSLLLAVLVFFLFALPSFIKEPQTKPSRHQRTENIKERSLQSLAKP 

KSQAPTRARRTTIYAEPAPE^ALNTQTQPKAHTTGDRGKEANQAPPEEQDKVPHTAQRAAW 

KSPEKEKT^1VNTLSPRGQDAGMASGRTEAQSWKSQDTKTTQGNGGQTRKLTASRTVSEKHQG 

KAATTAKTLI PKSQHRMLAPTGAVSTRTRQKGVTTAVI P P KE KK PQ AT P P PAP FQS PTTQRN 

QRLKAANFKSEPRWDFEEKYSFEIGGLQTTCPDSVKIKASKSLWLQKLFLPNLTLFLDSRHF 

NQSEWDRLEHFAPPFGFMELNYSLVQKVVTRFPPVPQQQLLIiASLPAGSLRCITCAVVGNGG 

ILNNSHMGQEIDSHDYVFRLSGALIKGYEQDVGTRTSFYGFTAFSLTQSLLILGNRGFKNVP 

LGKDWYLHFLEGTRDYEWLEALLMNQTVMSKNLFWFRHRPQEAFREALHMDRYL 

RYMKNRFLRSKTLDGAHWRIYRPTTGALLLLTALQLCDQVSAYGFITEGHERFSDHYYDTSW 

KRLIFYINHDFKLEREVWKRLHDEGIIRLYQRPGPGTAKAKN 
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FIGURE 243 

CGATGCGCGGACCCGGGCACCCCCTCCTCCTGGGGCTGCTGCTGGTGCTGGGGCCTTCGCCG 
GAGCAGCGAGTGGAAATTGTTCCTCGAGATCTGAGGATGAAGGACAAGTTTCTAAAACACCT 
TACAGGCCCTCTTTATTTTAGTCCAAAGTGCAGCAAACACTTCCATAGACTTTATCACAACA 
CCAGAGACTGCACCATTCCTGCATACTATAAAAGATGCGCCAGGCTTCTTACCCGGCTGGCT 
GT CAGTCCAGTGTGCATGGAGGAT AAGTGAGC AG AC CGTACAGG AG CAGC AC AC CAGGAG C C 
ATGAGAAGTGCCTTGGAAACCAACAGGGAAACAGAACTATCTTTATACACATCCCCTCATGG 
ACAAGAGATTTATTTTTGCAGACAGACTCTTCCATAAGTCCTTTGAGTTTTGTATGTTGTTG 
ACAGTTTGCAGATATATATTCGATAAATCAGTGTACTTGACAGTGTTATCTGTCACTTATTT 
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FIGURE 244 

MRGPGHPLLLGLLLVLGPS PEQRVE I VPRDLRMKDKFLKHLTGPLYFS PKCSKHFHRLYHNT 
RDCTI PAYYKRCARLLTRLAVS PVCMEDK 
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FIGURE 245 

GGGCTGGGCCCCGCCGCAGCTCCAGCTGGCCGGCTTGGTCCTGCGGTCCCTTCTCTGGGAGG 
CCCGACCCCGGCCGCGCCCAGCCCCCACCATGCCACCCGCGGGGCTCCGCCGGGCCGCGCCG 
CTCACCGCAATCGCTCTGTTGGTGCTGGGGGCTCCCCTGGTGCTGGCCGGCGAGGACTGCCT 
GTGGTACCTGGACCGGAATGGCTCCTGGCATCCGGGGTTTAACTGCGAGTTCTTCACCTTCT 
GCTGCGGGACCTGCTACCATCGGTACTGCTGCAGGGACCTGACCTTGCTTATCACCGAGAGG 
CAGCAGAAGCACTGCCTGGCCTTCAGCCCCAAGACCATAGCAGGCATCGCCTCAGCTGTGAT 
CCTCTTTGTTGCTGTGGTTGCCACCACCATCTGCTGCTTCCTCTGTTCCTGTTGCTACCTGT 
ACCGCCGGCGCCAGCAGCTCCAGAGCCCATTTGAAGGCCAGGAGATTCCAATGACAGGCATC 
CCAGTGCAGCCAGTATACCCATACCCCCAGGACCCCAAAGCTGGCCCTGCACCCCCACAGCC 
TGGCTTCATGTACCCACCTAGTGGTCCTGCTCCCCAATATCCACTCTACCCAGCTGGGCCCC 
CAGTCTACAACCCTGCAGCTCCTCCTCCCTATATGCCACCACAGCCCTCTTACCCGGGAGCC 
TGAGGAACCAGCCATGTCTCTGCTGCCCCTTCAGTGATGCCAACCTTGGGAGATGCCCTCAT 
CCTGTACCTGCATCTGGTCCTGGGGGTGGCAGGAGTCCTCCAGCCACCAGGCGCCAGACCAA 
GCCAAGCCCTGGGCCCTACTGGGGACAGAGCCCCAGGGAAGTGGAACAGGAGCTGAACTAGA 
ACTATGAGGGGTTGGGGGGAGGGCTTGGAATTATGGGCTATTTTTACTGGGGGCAAGGGAGG 
GAGATGACAGCCTGGGTCACAGTGCCTGTTTTCAAATAGTCCCTCTGCTCCCAAGATCCCAG 
CCAGGAAGGCTGGGGCCCTACTGTTTGTCCCCTCTGGGCTGGGGTGGGGGGAGGGAGGAGGT 
TCCGTCAGCAGCTGGCAGTAGCCCTCCTCTCTGGCTGCCCCACTGGCCACATCTCTGGCCTG 
CTAGATTAAAGCTGTAAAGACAAAA 
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FIGURE 246 

MPPAGLRRAAPLTAIALLVLGAPLVLAGEDCLWYLDRNGSWHPGFNCEFFTFCCGTCYHRYC 
CRDLTLLITERQQKHCLAFSPKTIAGIASAVILFVAWATTICCFLCSCCYLYRRRQQLQSP 
FEGQEIPMTGIPVQPVYPYPQDPKAGPAPPQPGFMYPPSGPAPQYPLYPAGPPVYNPAAPPP 
YMPPQPSYPGA 
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FIGURE 247 

GGGGGAGCTAGGCCGGCGGCAGTGGTGGTGGCGGCGGCGCAAGGGTGAGGGCGGCCCCAGAA 
CCCCAGGTAGGTAGAGCAAGAAGATGGTGTTTCTGCCCCTCAAATGGTCCCTTGCAACCATG 
TCATTTCTACTTTCCTCACTGTTGGCTCTCTTAACTGTGTCCACTCCTTCATGGTGTCAGAG 
CACTGAAGCATCTCCAAAACGTAGTGATGGGACACCATTTCCTTGGAATAAAATACGACTTC 
CTGAGTACGTCATCCCAGTTCATTATGATCTCTTGATCCATGCAAACCTTACCACGCTGACC 
TTCTGGGGAACCACGAAAGTAGAAATCACAGCCAGTCAGCCCACCAGCACCATCATCCTGCA 
TAGTCACCACCTGCAGATATCTAGGGCCACCCTCAGGAAGGGAGCTGGAGAGAGGCTATCGG 
AAGAACCCCTGCAGGTCCTGGAACACCCCCCTCAGGAGCAAATTGCACTGCTGGCTCCCGAG 
CCCCTCCTTGTCGGGCTCCCGTACACAGTTGTCATTCACTATGCTGGCAATCTTTCGGAGAC 
TTTCCACGGATTTTACAAAAGCACCTACAGAACCAAGGAAGGGGAACTGAGGATACTAGCAT 
CAACACAATTTGAACCCACTGCAGCTAGAATGGCCTTTCCCTGCTTTGATGAACCTGCCTTC 
AAAGCAAGTTTCTCAATCAAAATTAGAAGAGAGCCAAGGCACCTAGCCATCTCCAATATGCC 
ATTGGTGAAATCTGTGACTGTTGCTGAAGGACTCATAGAAGACCATTTTGATGTCACTGTGA 
AGATGAGCACCTATCTGGTGGCCTTCATCATTTCAGATTTTGAGTCTGTCAGCAAGATAACC 
AAGAGTGGAGTCAAGGTTTCTGTTTATGCTGTGCCAGACAAGATAAATCAAGCAGATTATGC 
ACTGGATGCTGCGGTGACTCTTCTAGAATTTTATGAGGATTATTTCAGCATACCGTATCCCC 
TACCCAAACAAGATCTTGCTGCTATTCCCGACTTTCAGTCTGGTGCTATGGAAAACTGGGGA 
CTGACAACATATAGAGAATCTGCTCTGTTGTTTGATGCAGAAAAGTCTTCTGCATCAAGTAA 
GCTTGGCATCACAGTGACTGTGGCCCATGAACTGGCCCACCAGTGGTTTGGGAACCTGGTCA 
CTATGGAATGGTGGAATGATCTTTGGCTAAATGAAGGATTTGCCAAATTTATGGAGTTTGTG 
TCTGTCAGTGTGACCCATCCTGAACTGAAAGTTGGAGATTATTTCTTTGGCAAATGTTTTGA 
CGCAATGGAGGTAGATGCTTTAAATTCCTCACACCCTGTGTCTACACCTGTGGAAAATCCTG 
CTCAGATCCGGGAGATGTTTGATGATGTTTCTTATGATAAGGGAGCTTGTATTCTGAATATG 
CTAAGGGAGTATCTTAGCGCTGACGCATTTAAAAGTGGTATTGTACAGTATCTCCAGAAGCA 
TAGCTATAAAAATACAAAAAACGAGGACCTGTGGGATAGTATGGCAAGTATTTGCCCTACAG 
ATGGTGTAAAAGGGATGGATGGCTTTTGCTCTAGAAGTCAACATTCATCTTCATCCTCACAT 
TGGCATCAGGAAGGGGTGGATGTGAAAACCATGATGAACACTTGGACACTGCAGAGGGGTTT 
TCCCCTAATAACCATCACAGTGAGGGGGAGGAATGTACACATGAAGCAAGAGCACTACATGA 
AGGGCTCTGACGGCGCCCCGGACACTGGGTACCTGTGGCATGTTCCATTGACATTCATCACC 
AGCAAATCCAACATGGTCCATCGATTTTTGCTAAAAACAAAAACAGATGTGCTCATCCTCCC 
AGAAGAGGTGGAATGGATCAAATTTAATGTGGGCATGAATGGCTATTACATTGTGCATTACG 
AGGATGATGGATGGGACTCTTTGACTGGCCTTTTAAAAGGAACACACACAGCAGTCAGCAGT 
AATGATCGGGCAAGTCTCATTAACAATGCATTTCAGCTCGTCAGCATTGGGAAGCTGTCCAT 
TGAAAAGGCCTTGGATTTATCCCTGTACTTGAAACATGAAACTGAAATTATGCCCGTGTTTC 
AAGGTTTGAATGAGCTGATTCCTATGTATAAGTTAATGGAGAAAAGAGATATGAATGAAGTG 
GAAACTCAATTCAAGGCCTTCCTCATCAGGCTGCTAAGGGACCTCATTGATAAGCAGACATG 
GACAGACGAGGGCTCAGTCTCAGAGCAAATGCTGCGGAGTGAACTACTACTCCTCGCCTGTG 
TGCACAACTATCAGCCGTGCGTACAGAGGGCAGAAGGCTATTTCAGAAAGTGGAAGGAATCC 
AATGGAAACTTGAGCCTGCCTGTCGACGTGACCTTGGCAGTGTTTGCTGTGGGGGCCCAGAG 
CACAGAAGGCTGGGATTTTCTTTATAGTAAATATCAGTTTTCTTTGTCCAGTACTGAGAAAA 
GCCAAATTGAATTTGCCCTCTGCAGAACCCAAAATAAGGAAAAGCTTCAATGGCTACTAGAT 
GAAAGCTTTAAGGGAGATAAAATAAAAACTCAGGAGTTTCCACAAATTCTTACACTCATTGG 
CAGGAACCCAGTAGGATACCCACTGGCCTGGCAATTTCTGAGGAAAAACTGGAACAAACTTG 
TACAAAAGTTTGAACTTGGCTCATCTTCCATAGCCCACATGGTAATGGGTACAACAAATCAA 
TTCTCCACAAGAACACGGCTTGAAGAGGTAAAAGGATTCTTCAGCTCTTTGAAAGAAAATGG 
TTCTCAGCTCCGTTGTGTCCAACAGACAATTGAAACCATTGAAGAAAACATCGGTTGGATGG 
ATAAGAATTTTGATAAAATCAGAGTGTGGCTGCAAAGTGAAAAGCTTGAACGTATG TAA AAA 
TTCCTCCCTTGCCCGGTTCCTGTTATCTCTAATCACCAACATTTTGTTGAGTGTATTTTCAA 
ACTAGAGATGGCTGTTTTGGCTCCAACTGGAGATACTTTTTTCCCTTCAACTCATTTTTTGA 
CTATCCCTGTGAAAAGAATAGCTGTTAGTTTTTCATGAATGGGCTTTTTCATGAATGGGCTA 
TCGCTACCATGTGTTTTGTTCATCACAGGTGTTGCCCTGCAACGTAAACCCAAGTGTTGGGT 
TCCCTGCCACAGAAGAATAAAGTACCTTATTCTTCTC^AAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 248 

WFLPLKWSLATMSFLLSSLLAL^ 

YDLLIHANLTTLTFWGTTKVEITASQPTSTIILHSHHLQISRATLRKGAGERLSEEPLQVLE 
HPPQEQIALLAPEPLLVGLPYTWIHYAGNLSETFHGFYKSTYRTKEGELRILASTQFEPTA 
ARMAFPCFDE PAFKAS FS I K I RRE PRHLAI SNM PLVKS VTVAEGL I EDHFDVTVKMSTYLVA 
FIISDFESVSKITKSGVKVSVYAVPDKINQADYALDAAVTLLEFYEDYFSIPYPLPKQDLAA 
IPDFQSGAMENWGLTTYRESALLFDAEKSSASSKLGITVTVAHELAHQWFGNLVTMEWWND^ 
WLNEGFAKFMEFVSVSVTHPELKVGDYFFGKCFDAMEVDALNSSHPVSTPVENPAQIREMFD 
DVSYDKGACILNMLREYLSADAFKSGIVQYLQKHSYKNTKNEDLWDSMASICPTDGVKGMDG 
FCSRSQHS S S SSHWHQEGVD VKTMMNTWTLQRGFPL I T I TVRGRNVHMKQEH YMKGSDGAPD 
TGYLWHVPLTFITSKSNMVHRFLLKTKTDVLILPEEVEWIKFNVGMNGYYIVHYEDDGWDSL 
TGLLKGTHTAVSSNDRASLINNAFQLVSIGKLSIEKALDLSLYLKHETEIMPVFQGLNELIP 
I^KLMEKRDI^EVETQFKAFLIRLL 

QRAEGYFRKWKESNGNLSLPVDVTLAVFAVGAQSTEGWDFLYSKYQFSLSSTEKSQIEFALC 
RTQNKEKLQWLLDESFKGDKIKTQEFPQILTLIGRNPVGYPLAWQFLRKNWNKLVQKFELGS 
SSIAHMVMGTTNQFSTRTRLEEVKGFFSSLKENGSQLRCVQQTIETIEENIGWMDKNFDKIR 
VWLQSEKLERM 
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FIGURE 249 

CAGCCACAGACGGGTCATGAGCGCGGTATTACTGCTGGCCCTCCTGGGGTTCATCCTCCCAC 
TGCCAGGAGTGCAGGCGCTGCTCTGCCAGTTTGGGACAGTTCAGCATGTGTGGAAGGTGTCC 
GACCTACCCCGGCAATGGACCCCTAAGAACACCAGCTGCGACAGCGGCTTGGGGTGCCAGGA 
CACGTTGATGCTCATTGAGAGCGGACCCCAAGTGAGCCTGGTGCTCTCCAAGGGCTGCACGG 
AGGCCAAGGACCAGGAGCCCCGCGTCACTGAGCACCGGATGGGCCCCGGCCTCTCCCTGATC 
TCCTACACCTTCGTGTGCCGCCAGGAGGACTTCTGCAACAACCTCGTTAACTCCCTCCCGCT 
TTGGGCCCCACAGCCCCCAGCAGACCCAGGATCCTTGAGGTGCCCAGTCTGCTTGTCTATGG 
AAGGCTGTCTGGAGGGGACAACAGAAGAGATCTGCCCCAAGGGGACCACACACTGTTATGAT 
GGCCTCCTCAGGCTCAGGGGAGGAGGCATCTTCTCCAATCTGAGAGTCCAGGGATGCATGCC 
CCAGCCAGGTTGCAACCTGCTCAATGGGACACAGGAAATTGGGCCCGTGGGTATGACTGAGA 
ACTGCAATAGGAAAGATTTTCTGACCTGTCATCGGGGGACCACCATTATGACACACGGAAAC 
TTGGCTCAAGAACCCACTGATTGGACCACATCGAATACCGAGATGTGCGAGGTGGGGCAGGT 
GTGTCAGGAGACGCTGCTGCTCATAGATGTAGGACTCACATCAACCCTGGTGGGGACAAAAG 
GCTGCAGCACTGTTGGGGCTCAAAATTCCCAGAAGACCACCATCCACTCAGCCCCTCCTGGG 
GTGCTTGTGGCCTCCTATACCCACTTCTGCTCCTCGGACCTGTGCAATAGTGCCAGCAGCAG 
CAGCGTTCTGCTGAACTCCCTCCCTCCTCAAGCTGCCCCTGTCCCAGGAGACCGGCAGTGTC 
CTACCTGTGTGCAGCCCCTTGGAACCTGTTCAAGTGGCTCCCCCCGAATGACCTGCCCCAGG 
GGCGCCACTCATTGTTATGATGGGTACATTCATCTCTCAGGAGGTGGGCTGTCCACCAAAAT 
GAGCATTCAGGGCTGCGTGGCCCAACCTTCCAGCTTCTTGTTGAACCACACCAGACAAATCG 
GGATCTTCTCTGCGCGTGAGAAGCGTGATGTGCAGCCTCCTGCCTCTCAGCATGAGGGAGGT 
GGGGCTGAGGGCCTGGAGTCTCTCACTTGGGGGGTGGGGCTGGCACTGGCCCCAGCGCTGTG 
GTGGGGAGTGGTTTGCCCTTCCTGCTAACTCTATTACCCCCACGATTCTTCACCGCTGCTGA 
CCACCCACACTCAACCTCCCTCTGACCTCATAACCTAATGGCCTTGGACACCAGATTCTTTC 
CCATTCTGTCCATGAATCATCTTCCCCACACACAATCATTCATATCTACTCACCTAACAGCA 
ACACTGGGGAGAGCCTGGAGCATCCGGACTTGCCCTATGGGAGAGGGGACGCTGGAGGAGTG 
GCTGCATGTATCTGATAATACAGACCCTGTCCTTTCA 
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MSAVLLLALLGFILPLPGVQALLCQFGTVQHVWKVSDLPRQWTPKNTSCDSGLGCQDTLMLI 
ESGPQVSLVLSKGCTEAKDQEPRVTEHRMGPGLSLISYTFVCRQEDFCNNLVNSLPLWAPQP 
PADPGSLRCP VCLSMEGCLEGTTEE I CPKGTTHCYDGLLRLRGGG I F SNLRVQGCMPQPGCN 
LLNGTQEIGPVGMTENCNRKDFLTCHRGTTIMTHGNLAQEPTDWTTSNTEMCEVGQVCQETL 
LLIDVGLTSTLVGTKGCSTVGAQNSQKTTIHSAPPGVLVASYTHFCSSDLCNSASSSSVLLN 
SLPPQAAPVPGDRQCPTCVQPLGTCSSGSPRMTCPRGATHCYDGYIHLSGGGLSTKMSIQGC 
VAQPSSFLLNHTRQIGIFSAREKRDVQPPASQffi^ 
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FIGURE 251 

GCGACGGGCAGGACGCCCCGTTCGCCTAGCGCGTGCTCAGGAGTTGGTGTCCTGCCTGCGCT 
CAGGATSAGGGGGAATCTGGCCCTGGTGGGCGTTCTAATCAGCCTGGCCTTCCTGTCACTGCTG 
CCATCTGGACATCCTCAGCCGGCTGGCGATGACGCCTGCTCTGTGCAGATCCTCGTCCCTGG 
CCTCAAAGGGGATGCGGGAGAGAAGGGAGACAAAGGCGCCCCCGGACGGCCTGGAAGAGTCG 
GCCCCACGGGAGAAAAAGGAGACATGGGGGACAAAGGACAGAAAGGCAGTGTGGGTCGTCAT 
GGAAAAATTGGTCCCATTGGCTCTAAAGGTGAGAAAGGAGATTCCGGTGACATAGGACCCCC 
TGGTCCTAATGGAGAACCAGGCCTCCCATGTGAGTGCAGCCAGCTGCGCAAGGCCATCGGGG 
AGATGGACAACCAGGTCTCTCAGCTGACCAGCGAGCTCAAGTTCATCAAGAATGCTGTCGCC 
GGTGTGCGCGAGACGGAGAGCAAGATCTACCTGCTGGTGAAGGAGGAGAAGCGCTACGCGGA 
CGCCCAGCTGTCCTGCCAGGGCCGCGGGGGCACGCTGAGCATGCCCAAGGACGAGGCTGCCA 
ATGGCCTGATGGCCGCATACCTGGCGCAAGCCGGCCTGGCCCGTGTCTTCATCGGCATCAAC 
GACCTGGAGAAGGAGGGCGCCTTCGTGTACTCTGACCACTCCCCCATGCGGACCTTCAACAA 
GTGGCGCAGCGGTGAGCCCAACi^ATGCCTACGACGAGGAGGACTGCGTGGAGATGGTGGCCT 
CGGGCGGCTGGAACGACGTGGCCTGCCACACCACCATGTACTTCATGTGTGAGTTTGACAAG 
GAGAACATGTGAGCCTCAGGCTGGGGCTGCCCATTGGGGGCCCCACATGTCCCTGCAGGGTT 
GGCAGGGACAGAGCCCAGACCATGGTGCCAGCCAGGGAGCTGTCCCTCTGTGAAGGGTGGAG 
GCTCACTGAGTAGAGGGCTGTTGTCTAAACTGAGAAAATGGCCTATGCTTAAGAGGAAAATG 
AAAGTGTTCCTGGGGTGCTGTCTCTGAAGAAGCAGAGTTTCATTACCTGTATTGTAGCCCCA 
ATGTCATTATGTAATTATTACCCAGAATTGCTCTTCCATAAAGCTTGTGCCTTTGTCCAAGC 
TATACAATAAAATCTTTAAGTAGTGCAGTAGTTAAGTCCAAAAAAAAAAAAAAAAAAA 
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FIGURE 252 

MRGNLALVGVLI SLAFLSLLPSGHPQPAGDDACS VQ I LVPGLKGDAGEKGDKGAPGRPGRVG 
PTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRKAIGE 
MDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMPKDEAAN 
GLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEEDCVEIWAS 
GGWNDVACHTTMYFMCEFDKENM 
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FIGURE 253 

AGTGACTGCAGCCTTCCTAGATCCCCTCCACTCGGTTTCTCTCTTTGCAGGAGCACCGGCAG 
CACCAGTGTGTGAGGGGAGCAGGCAGCGGTCCTAGCCAGTTCCTTGATCCTGCCAGACCACC 
C7VGCCCCCGGCACAGAGCTGCTCCACAGGCACCATCAGGATCATGCTGCTATTCACAGCCAT 
CCTGGCCTTCAGCCTAGCTCAGAGCTTTGGGGCTGTCTGTAAGGAGCCACAGGAGGAGGTGG 
TTCCTGGCGGGGGCCGCAGCAAGAGGGATCCAGATCTCTACCAGCTGCTCCAGAGACTCTTC 
AAAAGCCACTCATCTCTGGAGGGATTGCTCAAAGCCCTGAGCCAGGCTAGCACAGATCCTAA 
GGAATCAACATCTCCCGAGAAACGTGACATGCATGACTTCTTTGTGGGACTTATGGGCAAGA 
GGAGCGTCCAGCCAGAGGGAAAGACAGGACCTTTCTTACCTTCAGTGAGGGTTCCTCGGCCC 
CTTCATCCCAATCAGCTTGGATCCACAGGAAAGTCTTCCCTGGGAACAGAGGAGCAGAGACC 
TTTATAAGACTCTCCTACGGATGTGAATCAAGAGAACGTCCCCAGCTTTGGCATCCTCAAGT 
ATCCCCCGAGAGCAGAATAGGTACTCCACTTCCGGACTCCTGGACTGCATTAGGAAGACCTC 
TTTCCCTGTCCCAATCCCCAGGTGCGCACGCTCCTGTTACCCTTTCTCTTCCCTGTTCTTGT 
AACATTCTTGTGCTTTGACTCCTTCTCCATCTTTTCTACCTGACCCTGGTGTGGAAACTGCA 
TAGTGAATATCCCCAACCCCAATGGGCATTGACTGTAGAATACCCTAGAGTTCCTGTAGTGT 
CCTACATTAAAAATATAATGTCTCTCTCTATTCCTCAACAATAAAGGATTTTTGCATATGAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 254 

MRIMLLFTAILAFSLAQSFGAVCKEPQEEWPGGGRSKRDPDLYQLLQRLFKSHSSLEGLLK 
ALSQASTDPKESTSPEKRDMHDFFVGLMGKRSVQPEGKTGPFLPSVRVPRPLHPNQLGSTGK 
SSLGTEEQRPL 
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FIGURE 255 

GGGCGTCTCCGGCTGCTCCTATTGAGCTGTCTGCTCGCTGTGCCCGCTGTGCCTGCTGTGCC 
CGCGCTGTCGCCGCTGCTACCGCGTCTGCTGGACGCGGGAGACGCCAGCGAGCTGGTGATTG 
GAGCCCTGCGGAGAGCTCAAGCGCCCAGCTCTGCCCCAGGAGCCCAGGCTGCCCCGTGAGTC 
CCATAGTTGCTGCAGGAGTGGAGCCATGAGCTGCGTCCTGGGTGGTGTCATCCCCTTGGGGC 
TGCTGTTCCTGGTCTGCGGATCCCAAGGCTACCTCCTGCCCAACGTCACTCTCTTAGAGGAG 
CTGCTCAGCAAATACCAGCACAACGAGTCTCACTCCCGGGTCCGCAGAGCCATCCCCAGGGA 
GGACAAGGAGGAGATCCT CATGCTGCACAACAAG CTTCGGGG C CAGGTGCAGCCTCAGGCCT 
CCAACATGGAGTACATGGTGAGCGCCGGCTCCGGCCGCAGAGGCTGGCACCGGGGGTGGGGC 
CTGGGCCACCAGCCTGCTCTGTTCCCCAGCCAGCTCTGTTCCCCAGCCAGTGCGTGTGATGG 
CTGGCTCAGGGTCTCCTCTGGCAGGGGAGGATCCCGGCTCTGTTCTGTTTTGTTTGTTTGTT 
TTGAGACAGGGTCTCACTCTGCCACTGACGCTGGAGTGCAATGGCACAATCGTCATGCCCTG 
AAACCTTAGACTCCCGGGGTTAAGCGATCCTGCTTCAGCCTCCCAAGTAGCTGGAACTACAG 
GCATGCACCATGGTGCCCAGCTAGATTTTAAATATTTTGTGGAGATGGGGGTCTTGCTACGT 
TGCCCAGGCTGGTCTTGAACTCCTAGGCTCAAGCAATCCTCCTGCCTCAGCCTCTCAAAGTG 
CTAGGATTATAGGCATGAGTCACCCTGTCTGGCTCTGGCTCTGTTCTTAACATTCTGCCAAA 
ACAACACACGTGGGTTCCCTGTGCAGAGCCTGCCTCGTTGCCTTCATGTCACTCTTGGTAGC 
TCCACTGGGAACACAGCTCTCAGCCTTTCCCACCTGGAGGCAGAGTGGGGAGGGGCCCAGGG 
CTGGGCTTTGCTGATGCTGATCTCAGCTGTGCCACACGCTAGCTGCACCACCCTGACTTCTC 
CTTAGCCCGTGTGAGCCTCACTTTCCACTTGGAGAGTCCTTCCTCGCGTGGTTGCCATGACT 
GTGAGATAAGTCGAGGCTGTGAAGGGCCCGGCACAGACTGACCTGCCTCCCCAACCCCTAGG 
CTTTGGTAACCGGGAAAGGAGCTAACGGTGACAGAAGACAGCCAAGGTCAACCCTCCCGGGT 
GATTGTGATGGGTGTTCCAGGTGTGGTTGGGCGATGCTGCTACTTGACCCCAAGCTCCAGTG 
TGGAAACTTCCTTCCTGGCTGGTTTTCCAGAACTACAGAGGAATGGACCACAGTCTTCCAGG 
GTCCCTCCTCGTCCACCAACCGGGAGCCTCCACCTTGGCCATCCGTCAGCTATGAATGGCTT 
TTTAAACAAACCCACGTCCCAGCCTGGGTAACATGGTAAAGCCCCGTCTCTACAAAAAAATC 
CAAGTTAGCCGGGCATGGTGGTGCGCACCTGTAGTCCCAGCTGCAGTGGGACTGAGGTGGAG 
GTGGAGGTGGGGGGTGGGAGCTGAGGAAGGAGGATCGCTTGAGCCTGGGAAGTCGAGGCTGC 
AGTGAGCTGAGATTGCACCACTGCACTCCAGCCTGGGTGACAGAGCAAGACCCTGTCTCAAAAA 
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FIGURE 256 



MSCVLGGVIPLGLLFLVCGSQGYLLPNVTLLEELLSKYQHl^SHSRVRRAIPREDKEEILML 
HNKLRGQVQPQASNMEYMVSAGSGRRGWHRGWGLGHQPALFPSQLCSPASACDGWLRVSSGR 
GGSRLCSVLFVCFETGSHSATDAGVQWHNRHALKP 
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FIGURE 257 

AAGGAGAGGCCACCGGGACTTCAQTGTCTCCTCCATCCCAGGAGCGCAGTQGCCACT ATG GG 
GTCTGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGG 
GTATGACTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGC 
TTCCTGGAATTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCAC 
CCTCCACCATGCAAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTG 
TGTCCTTCTTGGCCCGGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTT 
CAGCAGGCCCCCACCCTCCTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 258 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTS 
VTLHHARSQHHWCNT 



WO 99/63088 °€^5^0 PCTAJS99/122S2 

FIGURE 259 

AATTGTATCTGTGTAATGTTAAAACAAACGAAATAAAATAGAAGGAAAAACTTTCTGAGTTT 
CAAAAACAACAGACTAGTACTCTAAAGAACTCTTTAAAACAATTAACTGTTAGGATTGCAGT 
TATGATTGGATATTATTTAATTCTGTTTCTGATGTGGGGTTCCTCCACTGTGTTCTGTGTGC 
TATTAATATTTACCATTGCAGAAGCTTCATTCAGTGTTGAAAATGAATGCTTAGTGGATCTG 
TGCCTCTTACGCATATGTTACAAATTATCTGGAGTTCCTAATCAATGCAGAGTTCCCCTCCC 
CTCCGATTGTTCTAAATAATTGAAAGATGTCTGCTGTGGAAAAAGGCATGTATTTAAATCTG 
TATGATTCTCAACCATCTTTAGTTGGGAAAGGTCCTTGAAAGCCAATGGAAATACTTTTTTT 
TTTTCTTGGCACTAATCAAGTGAGTGTTACCTTTTCACTTAGTAGGATGTGTTGTTACGCTA 
GTAAAATAGAAACCTGTGTTTATTCTCAGGTATTTTAGAAACAACAGCCATCATTTTATTTT 
ATGTGTGTGTTCTTGGCTGTATTCATAAATTATATATTTTGGGCTATCAAATATTACTTCAT 
TCAATATAAATAACAATAGTAGAAGTTGTTTACTTAGATATGCTTTCTAGTTGCATTTTCTC 
AGCCTATGTAAGACTACTTTGTTGTAATAGCCTTTGAAATTTACAGTACTGTCTCTCTACTA 
TCTTCAGATTACTTGATTCAAATAAACCAATTATGTTTGTAATTGATATTAATAAAACCAGA 
ATAAAAGTTCATATCTACCC 
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FIGURE 260 

MIGYYLILFLMWGSSTVFCVLLIFTIAEASFSVENECLVDLCLLRICYKLSGVPNQCRVPLP 
SDCSK 
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GAGGATTTGCCACAGCAGCGGATAGAGCAGGAGAGCACCACCGGAGCCCTTGAGACATCCTT 
GAGAAGAGCCACAGCATAAGAGACTGCCCTGCTTGGTGTTTTGCAGGATGATGGTGGCCCTT 
CGAGGAGCTTCTGCATTGCTGGTTCTGTTCCTTGCAGCTTTTCTGCCCCCGCCGCAGTGTAC 
CCAGGACCCAGCCATGGTGCATTACATCTACCAGCGCTTTCGAGTCTTGGAGCAAGGGCTGG 
AAAAATGTACCCAAGCAACGAGGGCATAGATTCAAGAATTCCAAGAGTTCTCAAAAAATATA 
TCTGTCATGCTGGGAAGATGTCAGACCTACACAAGTGAGTACAAGAGTGCAGTGGGTAACTT 
GGCACTGAGAGTTGAACGTGCCCAACGGGAGATTGACTACATACAATACCTTCGAGAGGCTG 
ACGAGTGCATCGTATCAGAGGACAAGACACTGGCAGAAATGTTGCTCCAAGAAGCTGAAGAA 
GAGAAAAAGATCCGGACTCTGCTGAATGCAAGCTGTGACAACATGCTGATGGGCATAAAGTC 
TTTGAAAATAGTGAAGAAGATGATGGACACACATGGCTCTTGGATGAAAGATGCTGTCTATA 
ACTCTCCAAAGGTGTACTTATTAATTGGATCCAGAAACAACACTGTTTGGGAATTTGCAAAC 
ATACGGGCATTCATGGAGGATAACACCAAGCCAGCTCCCCGGAAGCAAATCCTAACACTTTC 
CTGGCAGGGAACAGGCCAAGTGATCTACAAAGGTTTTCTATTTTTTCATAACCAAGCAACTT 
CTAATGAGATAATCAAATATAACCTGCAGAAGAGGACTGTGGAAGATCGAATGCTGCTCCCA 
GGAGGGGTAGGCCGAGCATTGGTTTACCAGCACTCCCCCTCAACTTACATTGACCTGGCTGT 
GGATGAGCATGGGCTCTGGGCCATCCACTCTGGGCCAGGCACCCATAGCCATTTGGTTCTCA 
CAAAGATTGAGCCGGGCACACTGGGAGTGGAGCATTCATGGGATACCCCATGCAGAAGCCAG 
GATGCTGAAGCCTCATTCCTCTTGTGTGGGGTTCTCTATGTGGTCTACAGTACTGGGGGCCA 
GGGCCCTCATCGCATCACCTGCATCTATGATCCACTGGGCACTATCAGTGAGGAGGACTTGC 
CCAACTTGTTCTTCCCCAAGAGACCAAGAAGTCACTCCATGATCCATTACAACCCCAGAGAT 
AAGCAGCTCTATGCCTGGAATGAAGGAAACCAGATCATTTACAAACTCCAGACAAAGAGAAA 
GCTGCCTCTGAAGTAATGCATTACAGCTGTGAGAAAGAGCACTGTGGCTTTGGCAGCTGTTC 
TACAGGACAGTGAGGCTATAGCCCCTTCACAATATAGTATCCCTCTAATCACACACAGGAAG 
AGTGTGTAGAAGTGGAAATACGTATGCCTCCTTTCCCAAATGTCACTGCCTTAGGTATCTTC 
CAAGAGCTTAGATGAGAGCATATCATCAGGAAAGTTTCAACAATGTCCATTACTCCCCCAAA 
CCTCCTGGCTCTCAAGGATGACCACATTCTGATACAGCCTACTTCAAGCCTTTTGTTTTACT 
GCTCCCCAGCATTTACTGTAACTCTGCCATCTTCCCTCCCACAATTAGAGTTGTATGCCAGC 
CCCTAATATTCACCACTGGCTTTTCTCTCCCCTGGCCTTTGCTGAAGCTCTTCCCTCTTTTT 
CAAATGTCTATTGATATTCTCCCATTTTCACTGCCCAACTAAAATACTATTAATATTTCTTT 
CTTTTCTTTTCTTTTTTTTGAGACAAGGTCTCACTATGTTGCCCAGGCTGGTCTCAAACTCC 
AGAGCTCAAGAGATCCTCCTGCCTCAGCCTCCTAAGTACCTGGGATTACAGGCATGTGCCAC 
CACACCTGGCTTAAAATACTATTTCTTATTGAGGTTTAACCTCTATTTCCCCTAGCCCTGTC 
CTTCCACTAAGCTTGGTAGATGTAATAATAAAGTGAAAATATTAACATTTGAATATCGCTTT 
CCAGGTGTGGAGTGTTTGCACATCATTGAATTCTCGTTTCACCTTTGTGAAACATGCACAAG 
TCTTTACAGCTGTCATTCTAGAGTTTAGGTGAGTAACACAATTACAAAGTGAAAGATACAGC 
TAGAAAATACTACAAATCCCATAGTTTTTCCATTGCCCAAGGAAGCATCAAATACGTATGTT 
TGTTCACCTACTCTTATAGTCAATGCGTTCATCGTTTCAGCCTAAAAATAATAGTCTGTCCC 
TTTAGCCAGTTTTCATGTCTGCACAAGACCTTTCAATAGGCCTTTCAAATGATAATTCCTCC 
AGAAAACCAGTCTAAGGGTGAGGACCCCAACTCTAGCCTCCTCTTGTCTTGCTGTCCTCTGT 
TTCTCTCTTTCTGCTTTAAATTCAATAAAAGTGACACTGAGCAAAAAAAAAAAAAAA 
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M^ALRGASALLVLFLAAFLPPPQCTQDPAMVHYIYQRFRVLEQGLEKCTQATRAYIQEFQE 
FSKNI S VMLGRCQT YTSE YKSAVGNLALRVERAQRE I D Y I QYLREADE C I VS ED KTLAEMLL 
QEAEEEKKIRTLLNASCDNMLMGIKSLKIVKKM1DTHGSWMKDAWNSPKWLLIGSRNNTV 
WEFANIRAFMEDNTKPAPRKQILTLSWQGTGQVIYKGFLFFHNQATSNEIIKYNLQKRTVED 
RMLLPGGVGRALVYQHSPSTYIDLAVDEHGLWAIHSGPGTHSHLVLTKIEPGTLGVEHSWDT 
PCRSQDAEASFLLCGVLYWYSTGGQGPHRITCIYDPLGTISEEDLPNLFFPKRPRSHSMIH 
YNPRDKQL YAWNEGNQ I I YKLQTKRKLPLK 
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FIGURE 263 

GGGCGCCCGCGTACTCACTAGCTGAGGTGGCAGTGGTTCCACCAACATGGAGCTCTCGCAGA 
TGTCGGAGCTCATGGGGCTGTCGGTGTTGCTTGGGCTGCTGGCCCTGATGGCGACGGCGGCG 
GTAGCGCGGGGGTGGCTGCGCGCGGGGGAGGAGAGGAGCGGCCGGCCCGCCTGCCAAAAAGC 
AAATGGATTTCCACCTGACAAATCTTCGGGATCCAAGAAGCAGAAACAATATCAGCGGATTC 
GGAAGGAGAAGCCTCAACAACACAACTTCACCCACCGCCTCCTGGCTGCAGCTCTGAAGAGC 
CACAGCGGGAACATATCTTGCATGGACTTTAGCAGCAATGGCAAATAGCTGGCTACCTGTGC 
AGATGATCGCACCATCCGCATCTGGAGCACCAAGGACTTCCTGCAGCGAGAGCACCGCAGCA 
TGAGAGCCAACGTGGAGCTGGACCACGCCACCCTGGTGCGCTTCAGCCCTGACTGCAGAGCC 
TTCATCGTCTGGCTGGCCAACGGGGACACCCTCCGTGTCTTCAAGATGACCAAGCGGGAGGA 
TGGGGGCTACACCTTCACAGCCACCCCAGAGGACTTCCCTAAAAAGCACAAGGCGCCTGTCA 
TCGACATTGGCATTGCTAACACAGGGAAGTTTATCATGACTGCCTCCAGTGACACCACTGTC 
CTCATCTGGAGCCTGAAGGGTCAAGTGCTGTCTACCATCAACACCAACCAGATGAACAACAC 
ACACGCTGCTGTATCTCCCTGTGGCAGATTTGTAGCCTCGTGTGGCTTCACCCCAGATGTGA 
AGGTTTGGGAAGTCTGCTTTGGAAAGAAGGGGGAGTTCCAGGAGGTGGTGCGAGCCTTCGAA 
CTAAAGGGCCACTCCGCGGCTGTGCACTCGTTTGCTTTCTCCAACGACTCACGGAGGATGGC 
TTCTGTCTCCAAGGATGGTACATGGAAACTGTGGGACACAGATGTGGAATACAAGAAGAAGC 
AGGACCCCTACTTGCTGAAGACAGGCCGCTTTGAAGAGGCGGCGGGTGCCGCGCCGTGCCGC 
CTGGCCCTCTCCCCCAACGCCCAGGTCTTGGCCTTGGCCAGTGGCAGTAGTATTCATCTCTA 
CAATACCCGGCGGGGCGAGAAGGAGGAGTGCTTTGAGCGGGTCCATGGCGAGTGTATCGCCA 
ACTTGTCCTTTGACATCACTGGCCGCTTTCTGGCCTCCTGTGGGGACCGGGCGGTGCGGCTG 
TTTCACAACACTCCTGGCCACCGAGCCATGGTGGAGGAGATGCAGGGCCACCTGAAGCGGGC 
CTCCAACGAGAGCACCCGCCAGAGGCTGCAGCAGCAGCTGACCCAGGCCCAAGAGACCCTGA 
AGAGCCTGGGTGCCCTGAAGAAGTGACTCTGGGAGGGCCCGGCGCAGAGGATTGAGGAGGAG 
GGATCTGGCCTCCTCATGGCACTGCTGCCATCTTTCCTCCCAGGTGGAAGCCTTTCAGAAGG 
AGTCTGCTGGTTTTCTTACTGGTGGCCCTGCTTCTTCCCATTGAAACTACTCTTGTCTACTT 
AGGTCTCTCTCTTCTTGCTGGCTGTGACTCCTCCCTGACTAGTGGCCAAGGTGCTTTTCTTC 
CTCCCAGGCCCAGTGGGTGGAATCTGTCCCCACCTGGCACTGAGGAGAATGGTAGAGAGGAG 
AGGAGAGAGAGAGAGAATGTGATTTTTGGCCTTGTGGCAGCACATCCTCACACCCAAAGAAG 
TTTGTAAATGTTCCAGAACAACCTAGAGAACACCTGAGTACTAAGCAGCAGTTTTGCAAGGA 
TGGGAGACTGGGATAGCTTCCCATCACAGAACTGTGTTCCATCAAAAAGACACTAAGGGATT 
TCCTTCTGGGCCTCAGTTCTATTTGTAAGATGGAGAATAATCCTCTCTGTGAACTCCTTGCA 
AAGATGATATGAGGCTAAGAGAATATCAAGTCCCCAGGTCTGGAAGAAAAGTAGAAAAGAGT 
AGTACTATTGTCCAATGTCATGAAAGTGGTAAAAGTGGGAACCAGTGTGCTTTGAAACCAAA 
TTAGAAACACATTCCTTGGGAAGGCAAAGTTTTCTGGGACTTGATCATACATTTTATATGGT 
TGGGACTTCTCTCTTCGGGAGATGATATCTTGTTTAAGGAGACCTCTTTTCAGTTCATCAAG 
TTCATCAGATATTTGAGTGCCCACTCTGTGCCCAAATAAATATGAGCTGGGGATTAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 264 

ME L S QMS ELMGLS VLLGLLALMATAAVARG WLRAGE ERSGRP ACQKANGF P P DKS S GS KKQK 
QYQRIRKEKPQQHNFTHRLLAAALKSHSGNISCMDFSSNGKYLATCADDRTIRIWSTKDFLQ 
REHRSMRANVELDHATLVRFSPDCRAFIWLANGDTLRVFKMTKREDGGYTFTATPEDFPKK 
HKAPVIDIGIANTGKFIMTASSDTTVLIWSLKGQVLSTINTNQMNNTHAAVSPCGRFVASCG 
FTPDVKVWEVCFGKKGEFQEWRAFELKGHSAAVHSFAFSNDSRRMASVSKDGTWKLWDTDV 
EYKKKQDPYLLKTGRFEEAAGAAPCRLALSPNAQVLALASGSSIHLYNTRRGEKEECFERVH 
GE C I ANLS FD I TGRFLAS CGDRAVRLFHNTPGHRAMVEEMQGHLKRASNE STRQRLQQQLTQ 
AQETLKS LGALKK 
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FIGURE 265 

TGGCCTCCCCAGCTTGCCAGGCACAAGGCTGAGCGGGAGGAAGCGAGAGGCATCTAAGCAGG 
CAGTGTTTTGCCTTCACCCCAAGTGACCATGAGAGGTGCCACGCGAGTCTCAATCATGCTCC 
TCCTAGTAACTGTGTCTGACTGTGCTGTGATCACAGGGGCCTGTGAGCGGGATGTCCAGTGT 
GGGGCAGGCACCTGCTGTGCCATCAGCCTGTGGCTTCGAGGGCTGCGGATGTGCACCCCGCT 
GGGGCGGGAAGGCGAGGAGTGCCACCCCGGCAGCCACAAGGTCCCCTTCTTCAGGAAACGCA 
AGCACCACACCTGTCCTTGCTTGCCCAACCTGCTGTGCTCCAGGTTCCCGGACGGCAGGTAC 
CGCTGCTCCATGGACTTGAAGAACATCAATTTTTAGGCGCTTGCCTGGTCTCAGGATACCCA 
CCATCCTTTTCCTGAGCACAGCCTGGATTTTTATTTCTGCCATGAAACCCAGCTCCCATGAC 
TCTCCCAGTCCCTACACTGACTACCCTGATCTCTCTTGTCTAGTACGCACATATGCACACAG 
GCAGACATACCTCCCATCATGACATGGTCCCCAGGCTGGCCTGAGGATGTCACAGCTTGAGG 
CTGTGGTGTGAAAGGTGGCCAGCCTGGTTCTCTTCCCTGCTCAGGCTGCCAGAGAGGTGGTA 
T^ATGGCAGAAAGGACATTCCCCCTCCCCTCCCCAGGTGACCTGCTCTCTTTCCTGGGCCCTG 
CCCCTCTCCCCACATGTATCCCTCGGTCTGAATTAGACATTCCTGGGCACAGGCTCTTGGGT 
GCATTGCTCAGAGTCCCAGGTCCTGGCCTGACCCTCAGGCCCTTCACGTGAGGTCTGTGAGG 
ACCAATTTGTGGGTAGTTCATCTTCCCTCGATTGGTTAACTCCTTAGTTTCAGACCACAGAC 
TCAAGATTGGCTCTTCCCAGAGGGCAGCAGACAGTCACCCCAAGGCAGGTGTAGGGAGCCCA 
GGGAGGCCAATCAGCCCCCTGAAGACTCTGGTCCCAGTCAGCCTGTGGCTTGTGGCCTGTGA 
CCTGTGACCTTCTGCCAGAATTGTCATGCCTCTGAGGCCCCCTCTTACCACACTTTACCAGT 
TAACCACTGAAGCCCCCAATTCCCACAGCTTTTCCATTAAAATGCAAATGGTGGTGGTTCAA 
TCTAATCTGATATTGACATATTAGAAGGCAATTAGGGTGTTTCCTTAAACAACTCCTTTCCA 
AGGATCAGCCCTGAGAGCAGGTTGGTGACTTTGAGGAGGGCAGTCCTCTGTCCAGATTGGGG 
TGGGAGCAAGGGACAGGGAGCAGGGCAGGGGCTGAAAGGGGCACTGATTCAGACCAGGGAGG 
CAACTACACACCAACATGCTGGCTTTAGAATAAAAGCACCAACTGAAAAAA 
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FIGURE 266 

MRGATRVS IMLLLVTVSDCAVITGACERDVQCGAGTCCAI SLWLRGLRMCTPLGREGEECHP 
GSHKVPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF 
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FIGURE 267 

AGCGCCCGGGCGTCGGGGCGGTAAAAGGCCGGCAGAAGGGAGGCACTTGAGAAATGTCTTTC 
CTCCAGGACCCAAGTTTCTTCACCATGGGGATGTGGTCCATTGGTGCAGGAGCCCTGGGGGC 
TGCTGCCTTGGCATTGCTGCTTGCCAACACAGACGTGTTTCTGTCCAAGCCCCAGAAAGCGG 
CCCTGGAGTACCTGGAGGATATAGACCTGAAAACACTGGAGAAGGAACCAAGGACTTTCAAA 
GCAAAGGAGCTATGGGAAAAAAATGGAGCTGTGATTATGGCCGTGCGGAGGCCAGGCTGTTT 
CCTCTGTCGAGAGGAAGCTGCGGATCTGTCCTCCCTGAAAAGCATGTTGGACCAGCTGGGCG 
TCCCCCTCTATGCAGTGGTAAAGGAGCACATCAGGACTGAAGTGAAGGATTTCCAGCCTTAT 
TTCAAAGGAGAAATCTTCCTGGATGAAAAGAAAAAGTTCTATGGTCCACAAAGGCGGAAGAT 
GATGTTTATGGGATTTATCCGTCTGGGAGTGTGGTACAACTTCTTCCGAGCCTGGAACGGAG 
GCTTCTCTGGAAACCTGGAAGGAGAAGGCTTCATCCTTGGGGGAGTTTTCGTGGTGGGATCA 
GGAAAGCAGGGCATTCTTCTTGAGCACCGAGAAAAAGAATTTGGAGACAAAGTAAACCTACT 
TTCTGTTCTGGAAGCTGCTAAGATGATCAAACCACAGACTTTGGCCTCAGAGAAAAAATGAT 
TGTGTGAAACTGCCCAGCTCAGGGATAACCAGGGACATTCACCTGTGTTCATGGGATGTATT 
GTTTCCACTCGTGTCCCTAAGGAGTGAGAAACCCATTTATACTCTACTCTCAGTATGGATTA 
TTAATGTATTTTAATATTCTGTTTAGGCCCACTAAGGCAAAATAGCCCCAAAACAAGACTGA 
C^AAATCTGAAAAACTAATGAGGATTATTAAGCTAAAACCTGGGAAATAGGAGGCTTAAAA 
TTGACTGCCAGGCTGGGTGCAGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCAAGG 
TGAGCAAGTCACTTGAGGTCGGGAGTTCGAGACCAGCCTGAGCAACATGGCGAAACCCCGTC 
TCTACTAAAAATACAAAAATCACCCGGGTGTGGTGGCAGGCACCTGTAGTCCCAGCTACCCG 
GGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGTGGAGGTTGCGGTGAGCTGAGATCA 
CACCACTGTATTCCAGCCTGGGTGACTGAGACTCTAACTAA 
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FIGURE 268 

MSFLQDPSFFTMGMWSIGAGALGAAALALLLANTDVFLSKPQKAALEYLEDIDLKTLEKEPR 
TFKAKELWEKNGAVIMAVRRPGCFLCREEAADLSSLKSMLDQLGVPLYAWKEHIRTEVKDF 
QPYFKGEIFLDEKKKFYGPQRRKI^FMGFIRLGVWYNFFRAWNGGFSGNLEGEGFILGGVFV 
VGSGKQGILLEHREKEFGDKVNLLSVLEAAKMIKPQTLASEKK 
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FIGURE 269 

ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCG 
GGCCAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGA 
AGCCCCTTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTG 
CTTCTGGCGCTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACA 
GACCACTTCTGCAAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATG 
GCAACCTGCGTCCGGAAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTG 
CTCCTGGCTGTGGGGCTGGCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGG 
CACCTACCGGCCCAGTAGCGAGGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGG 
ACTCCAAGGAGACGGTGCAGGGCTGCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCC 
CTTCATTGCTGTGTGACCTTGGGGAAAGGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAG 
TGCTTAATAGCAGGGAAGAAGGTACTTCAAAGACTCTGCCCCTGAGGTCAAGAGAGGATGGG 
GCTATTCACTTTTATATATTTATATAAAATTAGTAGTGAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 270 



MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIV 
VFSLLAALLIJVVGLALLWKLREKRQT^ 
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FIGURE 271 

AATATATCATCTATTTATCATTAATCAATAATGTATTCTTTTATTCCAATAACATTTGGGTT 
TTGGGATTTTAATTTTC^AACACAGCAGAATSACATTTTTTCTGTCACTATTATTATTGTTG 
GTATGTGAAGCTATTTGGAGATCCAATTCAGGAAGCAACACATTGGAGAATGGCTACTTTCT 
ATCAAGAAATAAAGAGAACCACAGTCAACCCACACAATCATCTTTAGAAGACAGTGTGACTC 
CTACCAAAGCTGTCAT^AACCACAGGCAAGGGCATAGTTAAAGGACGGAATCTTGACTCAAGA 
GGGTTAATTCTTGGTGCTGAAGCCTGGGGCAGGGGTGTAAAGAAAAACACTTAGATTCAATG 
ATTGTAAATTTAAGGCAAATACACATATTAGTATTACCTTAGTGTAATGTATCCCTGTCATA 
TATACAATAAGGTGAAATTATAAGTACCCTATGCAGTTGGCTGGACAGTTCTAAATTGGACT 
TTATTAATTTTTAAAATCAGTAACTGATTTATCACTGGCTATGTGCTTAGATCTACAGGAGA 
TCATATAATTTGATACAAATAAAAGAAAAGTGTTCTCTCCCCTTACAGAATTGACATTTTAA 
ATGCGATACAGTTAGAATAGGAAATATGACATTAGAAAGGAAGAATGACAGGGAGAAAGGAA 
AGAAGGGAAAATGTTGCCAAGGAAAAAAAAA 



WO 99/63088 ^ / PCT/US99/12252 

FIGURE 272 

MTFFLSLLLLLVCEAIWRSNSGSNTLENGYFLSRNKENHSQPTQSSLEDSVTPTKAVKTTGK 
G I VKGRNLDSRGL I LGAEAWGRGVKKNT 
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FIGURE 273 

GCCAGGAATAACTAGAGAGGAACAATGGGGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTCT 
GTGCCTGCTGCACCAGTCAAATACTTCCTTCATTAAGCTGAATAATAATGGCTTTGAAGATA 
TTGTCATTGTTATAGATCCTAGTGTGCCAGAAGATGAAAAAATAATTGAACAAATAGAGGAT 
ATGGTGACTACAGCTTCTACGTACCTGTTTGAAGCCACAGAAAAAAGATTTTTTTTCAAAAA 
TGTATCTATATTAATTCCTGAGAATTGGAAGGAAAATCCTCAGTACAAAAGGCCAAAACATG 
AAAACCATAAACATGCTGATGTTATAGTTGCACCACCTACACTCCCAGGTAGAGATGAACCA 
TACACCAAGCAGTTCACAGAATGTGGAGAGAAAGGCGAATACATTCACTTCACCCCTGACCT 
TCTACTTGGAAAAAAACAAAATGAATATGGACCACCAGGCAAACTGTTTGTCCATGAGTGGG 
CTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTCTACCGTGCTAAG 
TCAAAAAAAATCGAAGCAACAAGGTGTTCCGCAGGTATCTCTGGTAGAAATAGAGTTTATAA 
GTGTCAAGGAGGCAGCTGTCTTAGTAGAGCATGCAGAATTGATTCTACAACAAAACTGTATG 
GAAAAGATTGTCAATTCTTTCCTGATAAAGTACAAACAGAAAAAGCATCCATAATGTTTATG 
CT^GTATTGATTCTGTTGTTGAATTTTGTAACGAAAAAACCCATAATCAAGAAGCTCCAAG 
CCTACAAAACATAAAGTGCAATTTTAGAAGTACATGGGAGGTGATTAGCAATTCTGAGGATT 
TTAAT^AACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTCATTGCTGAAGATC 
AGTCAAAGAATTGTGTGCTTAGTTCTTGATAAGTCTGGAAGCATGGGGGGTAAGGACCGCCT 
AAATCGAATGAATCAAGCAGCAAAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGG 
TGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTAAATAAGCTAATCCAAATAAAAAGC 
AGTGATGAAAGAAACACACTCATGGCAGGATTACCTACATATCCTCTGGGAGGAACTTCCAT 
CTGCTCTGGAATTAAATATGCATTTCAGGTGATTGGAGAGCTACATTCCCAACTCGATGGAT 
CCGAAGTACTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTG 
AAACAAAGTGGGGCCATTGTTCATTTTATTGCTTTGGGAAGAGCTGCTGATGAAGCAGTAAT 
AGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTATGTTTCAGATGAAGCTCAGAACAATG 
GCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAGTCCCTT 
C AGCTCGAAAGTAAGGGATTAACACTGAATAGTAATG C CTGG AT GAACGACACTGT CAT AAT 
TGATAGTACAGTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTA 
TTTCTCTCTGGGATCCCAGTGGAACAATAATGGAAAATTTCACAGTGGATGCAACTTCCAAA 
ATGGCCTATCTCAGTATTCCAGGAACTGCAAAGGTGGGCACTTGGGCATACAATCTTCAAGC 
CAAAGCGAACCCAGAAACATTAACTATTACAGTAACTTCTCGAGCAGCAAATTCTTCTGTGC 
CTCCAATCACAGTGAATGCTAAAATGAATAAGGACGTAAACAGTTTCCCCAGCCCAATGATT 
GTTTACGCAGAAATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCAT 
TGAATCACAGAATGGACATACAGAAGTTTTGGAACTTTTGGATAATGGTGCAGGCGCTGATT 
CTTTCAAGAATGATGGAGTCTACTCCAGGTATTTTACAGCATATACAGAAAATGGCAGATAT 
AGCTTAAAAGTTCGGGCTCATGGAGGAGCAAACACTGCCAGGCTAAAATTACGGCCTCCACT 
GAATAGAGCCGCGTACATACCAGGCTGGGTAGTGAACGGGGAAATTGAAGCAAACCCGCCAA 
GACCTGAAATTGATGAGGATACTCAGACCACCTTGGAGGATTTCAGCCGAACAGCATCCGGA 
GGTGCATTTGTGGTATCACAAGTCCCAAGCCTTCCCTTGCCTGACCAATACCCACCAAGTCA 
AATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTTACATGGACAGCACCAG 
GAGATAATTTTGATGTTGGAAAAGTTCAACGTTATATCATAAGAATAAGTGCAAGTATTCTT 
GATCTAAGAGACAGTTTTGATGATGCTCTTCAAGTAAATACTACTGATCTGTCACCAAAGGA 
GGCCAACTCCAAGGAAAGCTTTGCATTTAAACCAGAAAATATCTCAGAAGAAAATGCAACCC 
ACATATTTATTGCCATTAAAAGTATAGATAAAAGCAATTTGACATCAAAAGTATCCAACATT 
GCACAAGTAACTTTGTTTATCCCTCAAGCAAATCCTGATGACATTGATCCTACACCTACTCC 
TACTCCTACTCCTACTCCTGATAAAAGTCATAATTCTGGAGTTAATATTTCTACGCTGGTAT 
TGTCTGTGATTGGGTCTGTTGTAATTGTTAACTTTATTTTAAGTACCACCATT TGA ACCTTA 
ACGAAGAAAAAAATCTTCAAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAATGTAAGT 
AAAGGATATTTCTGAATCTTAAAATTCATCCCATGTGTGATCATAAACTCATAAAAATAATT 
TTAAGATGTCGGAAAAGGATACTTTGATTAAATAAAAACACTCATGGATATGTAAAAACTGT 
CAAGATTAT^AATTTAATAGTTTCATTTATTTGTTATTTTATTTGTAAGAAATAGTGATGAAC 
AAAGATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAAT 
GATATTTCAAATTGCATCAAGAAATTAAAATCATCTATCTGAGTAGTCAAAATACAAGTAAA 
GGAGAGCAAATAAACAACATTTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 274 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNN^ 

LFEATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTEC 

GEKGEYIHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATR 

CSAGISGRNRVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWE 

FCNEKTHNQEAPSLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLV 

LDKSGSMGGKDRLNRMNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLM 

AGLPTYPLGGTSICSGIKYAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVH 

FIALGRAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLT 

LNSNAWMNDTVIIDSTVGKDTFFLITWNSLPPSISLWDPSGTIMENFTVDATSKMAYL 

TAKVGT WA YNLQ AKANPETLT I TVTSRAANS S VP P I T VNAKMNKD VNS F P S P M I VY AE I LQG 

YVPVLGANVTAFIESQNGHTEVLELLDNGAGADSFKNDGVYSRYFTAYTENGRYSLKVRAHG 

GANTARLKLRPPLNRAAYIPGWWNGEIEANPPRPEIDEDTQTTLEDFSRTASGGAFWSQV 

PSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDNFDVGKVQRYIIRISASILDLRDSFDD 

ALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAIKSIDKSNLTSKVSNIAQVTLFIP 

QANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWIVNFILSTTI 
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FIGURE 275 



CTCCTTAGGTGGAAACCCTGGGAGTAGAGTACTGACAGCAAAGACCGGGAAAGACCATACGTCCCCGG 
GCAGGGGTGACAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGAC 
GCCAAGGTAATTTTGACCCAGAGGAGCAATGATGTAGCCACCTCCTAACCTTCCCTTCTTGAACCCCC 
AGTTATGCCAGGATTTACTAGAGAGTGTCAACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGG 
TTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCAGTGCTCAGAAGTGACTTGCCTGAGGGTG 
GACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAAGGCTGTGATGGGAATGAA 
GGTGAAAACTTGGAGATTTCACTTCAGTGATTGCTTCTGCCTGGAAG 

AGCTGCTCTGTGTGGTGGTTAACTCCAAGAGGCAGAACTCGTTCTAGAAGGAAATGGATGCAAGCAGC 
TCCGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCT 
TTGAGGGATGCCACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTT CG CCGGGGG CTGCT 
TGCGTGGATTTCCCGGGTGGTGGTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGT 
TGGCCTGCACCCCAAAAGGTGACGAGGAGCAGCTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAG 
GAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC CG CAACTACGTGAG CAG C CTGAAGCG 
GCAGATCGCACAGCTCAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGGAATGGGCAGTACCAAG 
CCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTCCTGGCC 
TTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGGCCACAGAGTATGCAGC 
AGTGCCTTTCGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGCCTTACCCGCCACCCCG 
AGGAGAAGC CTGTGAGGAAGGACAAGCGGGATGAGTTGGTGGAAGC CATTGAATCAGCCTTGGAGACC 
CTGAACAATCCTGCAGAGAACAGCCCCAATCACCGTCCTTACACGGCCTCTGATTTCATAGAAGGGAT 
CTACCGAACAGAAAGGGACAAAGGGACATTGTATGAGCTCACCTTCAAAGGGGACCACAAACACGAAT 
TCAAACGGCTCATCTTATTTCGACCATTCAGCCCCATCATGAAAGTGAAAAATGAAAAGCTCAACATG 
GCCAACACGCTTATCAATGTTATCGTGCCTCTAGCAAAAAGGGTGGACAAGTTCCGGCAGTTCATGCA 
GAATTTC AGGGAGATGTGCATTGAGCAGGATGGGAGAGT CCAT CT CACTGTTGTTTACTTTGGGAAAG 
AAGAAATAAATGAAGTCAAAGGAAT ACTTGAAAACACTTC CAAAG CTG C CAACTTCAGGAACTTTACC 
TTCATCCAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGG 
AAGCAACGTCCTTCTCTTTTTCTGTGATGTGGACATCTACTTCACATCTGAATTCCTCAATACGTGTA 
GGCTGAATACACAGCCAGGGAAGAAGGTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATA 
AT AT ACGG CC AC C ATGATGCAGTCCCTCCCTTGGAACAGCAG CTGGT CATAAAGAAGGAAACTGGATT 
TTGGAGAGACTTTGGATTTGGGATGACGTGTCAGTATCGGTCAGACTTCATCAATATAGGTGGGTTTG 
ATCTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCTCCACAGCAACCTC 
ATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCTCTGGCATGAGAAGCGCTGCATGGACGAGCT 
GACCCCCGAGCAGTACAAGATGTGCATGCAGTCCAAGGCCATGAACGAGGCATCCCACGGCCAGCTGG 
GCATGCTGGTGTTCAGGCACGAGATAGAGG CT CAC CTT CG C AAACAGAAAC AGAAGACAAGTAGCAAA 
AAAACATGAACTCCCAGAGAAGGATTGTGGGAGACACTTTTTCTTTCCTTTTGCAATTACTGAAAGTG 
GCTGCAACAGAGAAAAGACTTCCATAAAGGACGACAAAAGAATTGGACTGATGGGTCAGAGATGAGAA 
AGCCTCCGATTTCTCTCTGTTGGGCTTTTTACAACAGAAATCAT^AATCTCCGCTTTGCCTGCAAAAGT 
AACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATG CTTGTG AG ATTATAAG CCTAATGGTG 
TGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGAAATATTCATG 
ATTTAAGAGCAGTTTTGTAAAAAATTCATTAGCATGAAAGGCAAGCATATTTCTCCTCATATGAATGA 
GCCTATCAGCAGGGCTCTAGTTTCTAGGAATGCTAAAATATCAGAAGGCAGGAGAGGAGATAGGCTTA 
TT ATGATACTAGTGAGTACATTAAGTAAAATAAAATGGAC CAGAAAAGAAAAGAAACC ATAAAT AT CG 
TGTCATATTTTCCCCAAGATTAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCT 
CCGTTTTTTTCTTTTATTTAAAAATGCACTTTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTA 
CCACTTTGCAAGCCTTACAAGAGAGCACAAGTTGGCCTACATTTTTATATTTTTTAAGAAGATACTTT 
GAGATGCATT ATGAGAACTTTCAGTTCAAAG CATCAAATTGATGC CATATC CAAGGACATGC CAAATG 
CTGATTCTGTCAGGCACTGAATGTCAGGCATTGAGACATAGGGAAGGAATGGTTTGTACTAATACAGA 
CGTACAGATACTTTCTCTGAAGAGTATTTTCGAAGAGGAGCAACTGAACACTGGAGGAAAAGAAAATG 
ACACTTTCTGCTTTACAGAAAAGGAAACTCATTCAGACTGGTGATATCGTGATGTACCTAAAAGTCAG 
AAACCACATTTTCTCCTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGT 
GTGAACCAAACAATCTCTTTTCAAAACAGGGTGCTCCTCCTGGCTTCTGGCTTCCATAAGAAGAAATG 
GAGAAAAATAT ATATATATATATATATATTGTGAAAGATCAAT CC AT CTGC CAGAAT CT AGTGGGATG 
GAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAGTAACTGAATTATTTTTTAAATTAAGC 
AGTTCTACTCAATCACCAAGATGCTTCTGAAAATTGCATTTTATTACCATTTCAAACTATTTTTTAAA 
AATAAATACAGTTAACAT AGAGTGGTTT CTTCATT CATGTGAAAATT ATTAGCCAGCACCAGATG CAT 
GAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACTCATTGTTTAAAAGCTTCAA 
GAACATTCAAGCTGTTGGTGTGTTAAAAAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTT 
AATTAAAACACAGGCC^TGAATGGAAGGTGGTATTGCACAGCTAATAAAATATGATTTGTGGATATGAA 
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FIGURE 276 



PCI7US99/12252 



MMMVRRGLLAWISRVVVLLVLLCCAISVLYM^ 

EWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQT^DLLAFL 
HSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIES 
ALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPI 
MKVKNEKLNMANTLINVIVPLAKRVD^ 

GILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCR 
LNTQPGKKVFYPVLFSQYNPGI I YGHHDAVPPLEQQLVI KKETGFWRDFGFGMTCQYRSDFI 
NIGGFDLDIKGWGGEDVHLYRKYLHSNLIVVRTPVRGLFHLWHEKRCMDELTPEQYKMCMQS 
KAMNEASHGQLGMLVFRHE I E AHLRKQKQKTS S KKT 
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FIGURE 277 

GAAAGAATGTTGTGGCTGCTCTTTTTTCTGGTGACTGCCATTCATGCTGAACTCTGTCAACC 
AGGTGCAGAAAATGCTTTTAAAGTGAGACTTAGTATCAGAACAGCTCTGGGAGATAAAGCAT 
ATGCCTGGGATACCAATGAAGAATACCTCTTCAAAGCGATGGTAGCTTTCTCCATGAGAAAA 
GTTCCCAACAGAGAAGCAACAGAAATTTCCCATGTCCTACTTTGCAATGTAACCCAGAGGGT 
ATCATTCTGGTTTGTGGTTACAGACCCTTCAAAAAATCACACCCTTCCTGCTGTTGAGGTGC 
AATCAGCCATAAGAATGAACAAGAACCGGATCAACAATGCCTTCTTTCTAAATGACC^AACT 
CTGGAATTTTTAAAAATCCCTTCCACACTTGCACCACCCATGGACCCATCTGTGCCCATCTG 
GATTATTATATTTGGTGTGATATTTTGCATCATCATAGTTGCAATTGCACTACTGATTTTAT 
CAGGGATCTGGCAACGTAGAAGAAAGAACAAAGAACCATCTGAAGTGGATGACGCTGAAGAT 
AAGTGTGAAAACATGATCACAATTGAAAATGGCATCCCCTCTGATCCCCTGGACATGAAGGG 
GGGCATATTAATGATGCCTTCATGACAGAGGATGAGAGGCTCACCCCTCTCTGAAGGGCTGT 
TGTTCTGCTTCCTCAAGAAATTAAACATTTGTTTCTGTGTGACTGCTGAGCATCCTGAAATA 
CCAAGAGCAGATCATATATTTTGTTTCACCATTCTTCTTTTGTAATAAATTTTGAATGTGCT 
TGAAAGTGAAAAGCAATCAATTATACCCACCAACACCACTGAAATCATAAGCTATTCACGAC 
TCAAAATATTCTAAAATATTTTTCTGACAGTATAGTGTATAAATGTGGTCATGTGGTATTTG 
TAGTTATTGATTTAAGCATTTTTAGAAATAAGATCAGGCATATGTATATATTTTCACACTTC 
AAAGACCTAAGGAAAAATAAATTTTCCAGTGGAGAATACATATAATATGGTGTAGAAATCAT 
TGAAAATGGATCCTTTTTGACGATCACTTATATCACTCTGTATATGACTAAGTAAACAAAAG 
TGAGAAGTAATTATTGTAAATGGATGGATAAAAATGGAATTACTCATATACAGGGTGGAATT 
TTATCCTGTTATCACACCAACAGTTGATTATATATTTTCTGAATATCAGCCCCTAATAGGAC 
AATTCTATTTGTTGACCATTTCTACAATTTGTAAAAGTCCAATCTGTGCTAACTTAATAAAG 
TAATAATCATCTCTTTTTAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 278 

MLVttLFFLVTAIHAELCQPGAENAFKTO 

NREATE I SHVLLCNVTQRVS FWFWTD P S KNHTL P AVE VQ S A I RMNKNR I NNAF FLND QTLE 
FLKIPSTLAPPMDPSVPIWIIIFGVIFCIIIVAIALLILSGIWQRRRKNKEPSEVDDAEDKC 
ENMITIENGIPSDPLDMKGGILMMPS 
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FIGURE 279 

AACTCAAACTCCTCTCTCTGGGAAAACGCGGTGCTTGCTCCTCCCGGAGTGGCCTTGGCAGG 
GTGTTGGAGCCCTCGGTCTGCCCCGTCCGGTCTCTGGGGCCAAGGCTGGGTTTCCCTCATGT 
ATGGCAAGAGCTCTACTCGTGCGGTGCTTCTTCTCCTTGGCATACAGCTCACAGCTCTTTGG 
CCTATAGCAGCTGTGGAAATTT AT AC CT CC CGGGTG CTGG AGGCTGTT AATGGGAC AG ATGC 
TCGGTTATy^ATGCACTTTCTCCAGCTTTGCCCCTGTGGGTGATGCTCTAACAGTGACCTGGA 
ATTTTCGTCCTCTAGACGGGGGACCTGAGCAGTTTGTATTCTACTACCACATAGATCCCTTC 
CAACCCATGAGTGGGCGGTTTAAGGACCGGGTGTCTTGGGATGGGAATCCTGAGCGGTACGA 
TGCCTCCATCCTTCTCTGGAAACTGCAGTTCGACGACAATGGGACATACACCTGCCAGGTGA 
AGAACCCACCTGATGTTGATGGGGTGATAGGGGAGATCCGGCTCAGCGTCGTGCACACTGTA 
CGCTTCTCTGAGATCCACTTCCTGGCTCTGGCCATTGGCTCTGCCTGTGCACTGATGATCAT 
AATAGTAATTGTAGTGGTCCTCTTCCAGCATTACCGGAAAAAGCGATGGGCCGAAAGAGCTC 
ATAAAGTGGTGGAGATAAAATCAAAAGAAGAGGAAAGGCTCAACCAAGAGAAAAAGGTCTCT 
GTTTATTTAGAAGACACAGACTAACAATTTTAGATGGAAGCTGAGATGATTTCCAAGAACAA 
GAACCCTAGTATTTCTTGAAGTTAATGGAAACTTTTCTTTGGCTTTTCCAGTTGTGACCCGT 
TTTCCAACCAGTTCTGCAGCATATTAGATTCTAGACAAGCAACACCCCTCTGGAGCCAGCAC 
AGTGCTCCTCCATATCACCAGTCATACACAGCCTCATTATTAAGGTCTTATTTAATTTCAGA 
GTGTAAATTTTTTC AAGTGCTC ATT AGGTTTT AT AAAC AAG AAG CT AC AT TT TTGC CCTTAA 
GACACTACTTACAGTGTTATGACTTGTATACACATATATTGGTATCAAAGGGGATAAAAGCC 
AATTTGTCTGTTACATTTCCTTTCACGTATTTCTTTTAGCAGCACTTCTGCTACTAAAGTTA 
ATGTGTTTACTCTCTTTCCTTCCCACATTCTCAATTAAAAGGTGAGCTAAGCCTCCTCGGTG 
TTTCTGATTAACAGTAAATCCTAAATTCAAACTGTTAAATGACATTTTTATTTTTATGTCTC 
TCCTTAACTATGAGACACATCTTGTTTTACTGAATTTCTTTCAATATTCCAGGTGATAGATT 
TTTGTCG 
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FIGURE 280 

MYGKSSTRAVLLLLGIQLTALWPIAAVEIYTSRVLEAVNGTDARLKCTFSSFAPVGDALTVT 
WNFRPLDGGPEQFVFYYHIDPFQPMSGRFKDRVSWDGNPERYDASILLWKLQFDDNGTYTCQ 
VKNPPDVDG VI GE I RLS WHTVRFS E IHFLALA I GS ACALM 1 1 IVIVWLFQHYRKKRWAER 
AHKWE IKS KEEERLNQEKKVS VYLEDTD 
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FIGURE 281 

GCATTTTTGTCTGTGCTCCCTGATCTTCAGGTCACCACCATGAAGTTCTTAGCAGTCCTGGT 
ACTCTTGGGAGTTTCCATCTTTCTGGTCTCTGCCCAGAATCCGACAACAGCTGCTCCAGCTG 
ACACGTATCCAGCTACTGGTCCTGCTGATGATGAAGCCCCTGATGCTGAAACCACTGCTGCT 
GCAACCACTGCGACCACTGCTGCTCCTACCACTGCAACCACCGCTGCTTCTACCACTGCTCG 
TAAAGACATTCCAGTTTTACCCAAATGGGTTGGGGATCTCCCGAATGGTAGAGTGTGTCCCT 
GAGATGGAATCAGCTTGAGTCTTCTGCAATTGGTCACAACTATTCATGCTTCCTGTGATTTC 
ATCCAACTACTTACCTTGCCTACGATATCCCCTTTATCTCTAATCAGTTTATTTTCTTTCAA 
ATAAAAAATAACTATGAGCAACATAAAAAAAAAAAAA 
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FIGURE 282 

MKFLAVLVLLGVS I FLVSAQNPTTAAPADTYPATGPADDEAPDAETTAAATTATTAAPTTAT 
TAASTTARKD I PVLP KWVGDLPNGRVCP 
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FIGURE 283 



GGACTCTGAAGGTCCCAAGCAGCTGCTGAGGCCCCCAAGGAAGTGGTTCCAACCTTGGACCC 
CTAGGGGTCTGGATTTGCTGGTTAACAAGATAACCTGAGGGCAGGACCCCATAGGGGAAT^C 
TACCTCCTGCCCTTCCACCTGCCCTGGTGTTCACGGTGGCCTGGTCCCTCCTTGCCGAGAGA 
GTGTCCTGGGTCAGGGACGCAGAGGACGCTCACAGACTCCAGCCCTTTGTTACCGAGAGGAC 
ACTTGGCAAGGTCCAGCGATGGTCCGGAGTCCACACACAGACTGGCGGCAGGGCAGGAGGGG 
GACAGTTCTGTTGTGCTTGGTTGGACAGTAAGAGGGTCTTGGCCAGTCCAGGGTGGGGGGCG 
GCAAACTCCATAAAGAACCAGAGGGTCTGGGCCCCGGCCACAGAGTCATCTGCCCAGCTCCT 
CTGCTGCTGGCCAGTGGGAGTGGCACGAGGTGGGGCTTTGTGCCAGTAAAACCACAGGCTGG 
ATTTGCCTGCGGGCCATGGTCCCTGTCTAGGGCAGCAATTCTCAACCTTCTTGCTCTCAGGA 
CCCCAAAGAGCTTTCATTGTATCTATTGATTTTTACCACATTAGCAATTAAAACTGAGA/^AT 
GGGCCGGGCACGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGAT 
CACCTGAGATCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCTTGTCTACTAAAAA 
TACAAAAAATTAGCCAGGCACAGTGGTGTGCACTGGTAGTCCCAGTTACTCGGGAGGCTGAG 
GCAGGAAAATCGCTTGAACCCAGGAGGCGGACGTTGCGGTGAGCCGAGATCGCGCCGCTGAT 
TCCAGCCTGGGCGACAAGAGTGAGACTCCATCTCACACA 
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FIGURE 284 

MLPPALPPALVFTVAWSLLAERVSWVRDAEDAHRLQPFVTERTLGKVQRWSGVHTQTGGRAG 
GGQFCCAWLDSKRVLASPGWGAANSIKNQRVWAPATESSAQLLCCWPVGVARGGALCQ 
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FIGURE 285 

GTCATOCCAGTGCCTGCTCTGTGCCTGCTCTGGGCCCTGGCAATGGTGACCCGGCCTGCCTCA 
GCGGCCCCCATGGGCGGCCCAGAACTGGCACAGCATGAGGAGCTGACCCTGCTCTTCCATGG 
GACCCTGCAGCTGGGCCAGGCCCTCAACGGTGTGTACAGGACCACGGAGGGACGGCTGACAA 
AGGCCAGGAACAGCCTGGGTCTCTATGGCCGCACAATAGAACTCCTGGGGCAGGAGGTCAGC 
CGGGGCCGGGATGCAGCCCAGGAACTTCGGGCAAGCCTGTTGGAGACTCAGATGGAGGAGGA 
TATTCTGCAGCTGCAGGCAGAGGCCACAGCTGAGGTGCTGGGGGAGGTGGCCCAGGCACAGA 
AGGTGCTACGGGACAGCGTGCAGCGGCTAGAAGTCCAGCTGAGGAGCGCCTGGCTGGGCCCT 
GCCTACCGAGAATTTGAGGTCTTAAAGGCTCACGCTGACAAGCAGAGCCACATCCTATGGGC 
CCTCACAGGCCACGTGCAGCGGCAGAGGCGGGAGATGGTGGCACAGCAGCATCGGCTGCGAC 
AGATCCAGGAGAGACTCCACACAGCGGCGCTCCCAGCC2GAATCTGCCTGGATGGAACTGAG 
GACCAATCATGCTGCAAGGAACACTTCCACGCCCCGTGAGGCCCCTGTGCAGGGAGGAGCTG 
CCTGTTCACTGGGATCAGCCAGGGCGCCGGGCCCCACTTCTGAGCACAGAGCAGAGACAGAC 
GCAGGCGGGGACAAAGGCAGAGGATGTAGCCCCATTGGGGAGGGGTGGAGGAAGGACATGTA 
CCCTTTCATGCCTACACACCCCTCATTAAAGCAGAGTCGTGGCATTTCAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 286 

MPVPALCLLWALAMVTR^ 

ARNSLGLYGRTIELLGQEVSRGRDAAQELRASLLETQMEEDILQLQAEATAEVLGEVAQAQK 
VLRDSVQRLEVQLRSAWLGPAYREFEVLKAHADKQSHILWALTGHVQRQRREMVAQQHRLRQ 
IQERLHTAALPA 
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FIGURE 287 

GG CAACATGGCTCAGCAGGCTTGCCCCAGAGCCATGGCAAAGAATGGACTTGTAATTTGCAT 
CCTGGTGATCACCTTACTCCTGGACCAGACCACCAGCCACACATCCAGATTAAAAGCCAGGA 
AGCACAGCAAACGTCGAGTGAGAGACAAGGATGGAGATCTGAAGACTCAAATTGAAAAGCTC 
TGGACAGAAGTCAATGCCTTGAAGGAAATTCAAGCCCTGCAGACAGTCTGTCTCCGAGGCAC 
TAAAGTTCACAAGAAATGCTACCTTGCTTCAGAAGGTTTGAAGCATTTCCATGAGGCCAATG 
AAGACTGCATTTCCAAAGGAGGAATCCTGGTTATCCCCAGGAACTCCGACGAAATCAACGCC 
CTCCAAGACTATGGTAAAAGGAGCCTGCCAGGTGTCAATGACTTTTGGCTGGGCATCAATGA 
CATGGTCACGGAAGGCAAGTTTGTTGACGTCAACGGAATCGCTATCTCCTTCCTCAACTGGG 
ACCGTGCACAGCCTAACGGTGGCAAGCGAGAAAACTGTGTCCTGTTCTCCCAATCAGCTCAG 
GGCAAGTGGAGTGATGAGGCCTGTCGCAGCAGCAAGAGATACATATGCGAGTTCACCATCCC 
TAAATAGGTCTTTCTCCAATGTGTCCTCCAAGCAAGATTCATCATAACTTATAGGTTCATGA 
TCTCTAAGATCAAGTAAAAATCATAATTTTTACTTATTAAAAAATTGCAACACAAGATCAAT 
GTCCATAGCAATATGATAGCATCAGCCAATTTTGCTAACACATTTCTTTGGGATTTTGCCCT 
TCCTGGGGTATAGGGGATCAGAAATATTGATCCATGTGCACGCAGATAAAATGGCTTCTGCT 
AAACAGACTAAAATCTTTCTCTCTAGTCTTTCTCACTTGTACAAACCCAGTTTGTTTTCAAA 
AAATCACAGTAGCAATGCAACTCATCACTCTAGAAAAGCAAGCTTAGGCTACCTGAAAGATT 
TTCCCTTGGAAGTTTAGCGTATGTTTGACTAACAAAAATTCCCTACATCAGAGACTCTAGGT 
GCTATATAATCCAAAAACTTTTCAGCCTGTTGCTCATTCTGTCCCATGCTGGCAATAATACC 
TTGTCAGCCCATTACCCTTATTTTGAATTGCTCCATCTCCTGGTGGGACTTGTATCTTGTCT 
GCCATATCAGAACACAAACCCCTGAAGAGGTTCTGATTTGATTTTTTTTTTTTCTTCATGCC 
TACCCTTTTTTTGGAAGTTTCCAGCCGCAATTTGAAATGAAATGACAAGGTGTATATTTGAT 
CAATTTTCATTCCCACCATTGCATTACAACCTCTAACTTAAATGGGTAACCCTAAGGCATAT 
CAAAGAAGCAGATTGCATGATAAACGGAAATAGAAAAAAAGAACCTACATTTATTTTGCTTT 
AGCATCCTTACTCTCACCTTTTATGAGATTGAGAGTGGACTTACATTTCCTTTTTTACATTT 
TCGTATATTTATTTTTTTTAGCCATCATTATATGTTTAAGTCTATTATGGGCAACCAATCTT 
TGGAAGCTGAAAACTGAATTTAAAGAATGCTATCTTGGAAAATTGCATACGTCTGTGCAATT 
TTTTATTCTGCCTAGTGCTATTCTGCTTGTTTAACTAGATTGTACAAAATAACTTCATTGCT 
TAATATCAAATTACAAAGTTTAGACTTGGAGGGAAATGGGCTTTTTAGAAGCAAACAATTTT 
AAATATATTTTGTTCTTCAAATAAATAGTGTTTAAACATTGAATGTGTTTTGTGAACAATAT 
CCCACTTTGCAAACTTTAACTACACATGCTTGGAATTAAGTTTTAGCTGTTTTCATTGCTCA 
ATAATAAAGCCTGAATTCTGATCAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 288 

MAQQACPRAMAKNGLVICILVITLLLDQTTSOT 

EVNALKE I QALQTVCLRGTKVHKKC YLASEGLKHFHEANEDC I S KGG I LV I PRNSDE I NALQ 
DYGKRSLPGVNDFWLG INDMVTEGKFVDVNGI AI SFLNWDRAQPNGGKRENCVLFSQSAQGK 
WSDEACRSSKRYICEFTIPK 
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FIGURE 289 

GCGAGGACCGGGTATAAGAAGCCTCGTGGCCTTGCCCGGGCAGCCGCAGGTTCCCCGCGCGC 
CCCGAGCCCCCGCGCCATGAAGCTCGCCGCCCTCCTGGGGCTCTGCGTGGCCCTGTCCTGCA 
GCTCCGCTGCTGCTTTCTTAGTGGGCTCGGCCAAGCCTGTGGCCCAGCCTGTCGCTGCGCTG 
GAGTCGGCGGCGGAGGCCGGGGCCGGGACCCTGGCCAACCCCCTCGGCACCCTCAACCCGCT 
GAAGCTCCTGCTGAGCAGCCTGGGCATCCCCGTGAACCACCTCATAGAGGGCTCCCAGAAGT 
GTGTGGCTGAGCTGGGTCCCCAGGCCGTGGGGGCCGTGAAGGCCCTGAAGGCCCTGCTGGGG 
GCCCTGACAGTGTTTGGCTGAGCCGAGACTGGAGCATCTACACCTGAGGACAAGACGCTGCC 
CACCCGCGAGGGCTGAAAACCCCGCCGCGGGGAGGACCGTCCATCCCCTTCCCCCGGCCCCT 
CTC^TAAACGTGGTTAAGAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAA 
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FIGURE 290 

MKLAMjLGLCVALSCSSAAAFLVGSA^ 

SLG I PVNHL I EGSQKCVAELGPQAVGAVKALKALLGALTVFG 
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TGAAGGACTTTTCCAGGACCCAAGGCCACACACTGGAAGTCTTGCAGCTGAAGGGAGGCACT 
CCTTGGCCTCCGCAGCCGATCACATGAAGGTGGTGCCAAGTCTCCTGCTCTCCGTCCTCCTG 
GCACAGGTGTGGCTGGTACCCGGCTTGGCCCCCAGTCCTCAGTCGCCAGAGACCCCAGCCCC 
TCAGAACCAGACCAGCAGGGTAGTGCAGGCTCCCAGGGAGGAAGAGGAAGATGAGCAGGAGG 
CCAGCGAGGAGAAGGCCGGTGAGGAAGAGAAAGCCTGGCTGATGGCCAGCAGGCAGCAGCTT 
GCCAAGGAGACTTCAAACTTCGGATTCAGCCTGCTGCGAAAGATCTCCATGAGGCACGATGG 
CAACATGGTCTTCTCTCCATTTGGCATGTCCTTGGCCATGACAGGCTTGATGCTGGGGGCCA 
CAGGGCCGACTGAAACCCAGATCAAGAGAGGGCTCCACTTGCAGGCCCTGAAGCCCACCAAG 
CCCGGGCTCCTGCCTTCCCTCTTTAAGGGACTCAGAGAGACCCTCTCCCGCAACCTGGAACT 
GGGCCTCTCACAGGGGAGTTTTGCCTTCATCCACAAGGATTTTGATGTCAAAGAGACTTTCT 
TCAATTTATCCAAGAGGTATTTTGATACAGAGTGCGTGCCTATGAATTTTCGCAATGCCTCA 
CAGGCCAAAAGGCTCATGAATCATTACATTAACAAAGAGACTCGGGGGAAAATTCCCAAACT 
GTTTGATGAGATTAATCCTGAAACCAAATTAATTCTTGTGGATTACATCTTGTTCAAAGGGA 
AATGGTTGACCCCATTTGACCCTGTGTTCACCGAAGTCGACACTTTCCACCTGGACAAGTAC 
AAGACCATTAAGGTGCCCATGATGTACGGTGCAGGCAAGTTTGCCTCCACCTTTGACAAGAA 
TTTTCGTTGTCATGTCCTCAAACTGCCCTACCAAGGAAATGCCACCATGCTGGTGGTCCTCA 
TGGAGAAAATGGGTGACCACCTCGCCCTTGAAGACTACCTGACCACAGACTTGGTGGAGACA 
TGGCTCAGAAACATGAAAACCAGAAACATGGAAGTTTTCTTTCCGAAGTTCAAGCTAGATCA 
GAAGTATGAGATGCATGAGCTGCTTAGGCAGATGGGAATCAGAAGAATCTTCTCACCCTTTG 
CTGACCTTAGTGAACTCTCAGCTACTGGAAGAAATCTCCAAGTATCCAGGGTTTTACGAAGA 
ACAGTGATTGAAGTTGATGAAAGGGGCACTGAGGCAGTGGCAGGAATCTTGTCAGAAATTAC 
TGCTTATTCCATGCCTCCTGTCATCAAAGTGGACCGGCCATTTCATTTCATGATCTATGAAG 
AAACCTCTGGAATGCTTCTGTTTCTGGGCAGGGTGGTGAATCCGACTCTCCTATAATTCAGG 
ACATGCATAAGCACTTCGTGCTGTAGTAGATGCTGAATCTGAGGTATCAAACACACACAGGA 
TACCAGCAATGGATGGCAGGGGAGAGTGTTCCTTTTGTTCTTAACTAGTTTAGGGTGTTCTC 
AAATAAATACAGTAGTCCCCACTTATCTGAGGGGGATACATTCAAAGACCCCCAGCAGATGC 
CTGAAACGGTGGACAGTGCTGAACCTTATATATATTTTTTCCTACACATACATACCTATGAT 
AAAGTTTAATTTATAAATTAGGCACAGTAAGAGATTAACAATAATAACAACATTAAGTAAAA 
TGAGTTACTTGAACGCAAGCACTGCAATACCATAACAGTCAAACTGATTATAGAGAAGGCTA 
CTAAGTGACTCATGGGCGAGGAGCATAGACAGTGTGGAGACATTGGGCAAGGGGAGAATTCA 
CATCCTGGGTGGGACAGAGCAGGACGATGCAAGATTCCATCCCACTACTCAGAATGGCATGC 
TGCTTAAGACTTTTAGATTGTTTATTTCTGGAATTTTTCATTTAATGTTTTTGGACCATGGT 
TGACCATGGTTAACTGAGACTGCAGAAAGCAAAACCATGGATAAGGGAGGACTACTACAAAA 
GCATTAAATTGATACATATTTTTTAAAAAAAAAAAAAAAAAAA 
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FIGURE 292 

MKWPSLLLSVLLAQVWLVPGLAPSPQSPETPAPQNQTSRVVQAPREEEEDEQEASEEKAGE 
EEKAWLMASRQQIJtfCETSNFGFSLLRKISMRHDGNW 

KRGLHLQALKPTKPGLLPSLFKGLRETLSRNLELGLSQGSFAFIHKDFDVKETFFNLSKRYF 
DTECVPMNFRNASQAKRLMNHYINKETRGKIPKLFDEINPETKLILVDYILFKGKWLTPFDP 
VFTEVDTFHLDKYKTIKVPMMYGAGKFASTFDKNFRCHVLKLPYQGNATMLVVLMEKMGDHL 
ALEDYLTTDLVETWLRNMKTRNMEVFFPKFKLDQKYEMHELLRQMGIRRIFSPFADLSELSA 
TGRNLQVSRVLRRTVIEVDERGTEAVAG ILSE ITAYSMPPVI KVDRPFHFMI YEETSGMLLF 
LGRWNPTLL 
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FIGURE 293 



CTGGGATCAGCCACTGCAGCTCCCTGAGCACTCTCTACAGAGACGCGGACCCCAGACATCAG 
GAGGCTCCTCCTGGTCACCAGCCTGGTGGTTGTGCTGCTGTGGGAGGCAGGTGCAGTCCCAG 
CACCCAAGKSTCCCTATCAAGATGGAAGTCA^ 

GCCTGGGGCGCCCGTGTGGTGGAGCCTCCGGAGAAGGACGACCAGCTGGTGGTGCTGTTCCC 
TGTCCAGAAGCCGAAACTCTTGACCACCGAGGAGAAGCCACGAGGTCAGGGCAGGGGCCCCA 
TCCTTCCAGGCACCAAGGCCTGGATGGAGACCGAGGACACCCTGGGCCGTGTCCTGAGTCCC 
GAGCCCGACCATGACAGCCTGTACCACCCTCCGCCTGAGGAGGACCAGGGCGAGGAGAGGCC 
CCGGTTGTGGGTGATGCCAAATCACCAGGTGCTCCTGGGACCGGAGGAAGACCAAGACCACA 
TCTACCACCCCCAGTAGGGCTCCAGGGGCCATCACTGCCCCCGCCCTGTCCCAAGGCCCAGG 
CTGTTGGGACTGGGACCCTCCCTACCCTGCCCCAGCTAGACAAATAAACCCCAGCAGGCAAA 
AAAAAAAAAAAAAAAA 
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FIGURE 294 

MRRLLLVTSLWVLLWEAGAVPAPKVPIKMQVKHWPSEQDPEKAWGARWEPPEKDDQLWL 
FPVQKPKLLTTEEKPRGQGRGPILPGTKAWMETEDTLGRVLSPEPDHDSLYHPPPEEDQGEE 
RPRLWVMPNHQVLLGPEEDQDHIYHPQ 
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FIGURE 295 

AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTG 
TACCCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGC 
TGTTTCTCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAA 
TGGACCTGTTCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCC 
TAGTGCATTTGATGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCT 
GTGACATGACCTCTGGGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATG 
CGTGGGAAGTGCACGGTGGGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCC 
AGAGGGGGACGGCAACTGGGCCAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCG 
ATGACTACAAGAACCCTGGCTACTACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTG 
CCC^yVTAAGTCCCCCATGC^GCACTGGAGAAACAGCTCCCTGCTGAGGTACCGCACGGACAC 
TGGCTTCCTCCAGACACTGGGACATAATCTGTTTGGCATCTACCAGAAATATCCAGTGAAAT 
ATGGAGAAGGAAAGTGTTGGACTGACAACGGCCCGGTGATCCCTGTGGTCTATGATTTTGGC 
GACGCCCAGAAAACAGCATCTTATTACTCACCCTATGGCCAGCGGGAATTCACTGCGGGATT 
TGTTCAGTTCAGGGTATTTAATAACGAGAGAGCAGCCAACGCCTTGTGTGCTGGAATGAGGG 
TCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAGGATACTTTCCAGAGGCCAGT 
CCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATATGGAACTCATGTTGGTTA 
CAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCGTTGAGAGTTTTGTG 
GGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAACAACTTACCCA 
GTAGCTAGAATGTTAATGGCAGAAGAGAAAACAATAAATCATATTGACTCAAGAAAAAAA 
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FIGURE 29fi 

MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTEN 
GVIYQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFG 
SAEAATSDDYKNPG YYD I QAKDLG I WHVPNKS PMQHWRNSSLLRYRTDTGFLQTLGHNLFG I 
YQKYPVKYGEGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAAN 
ALCAGMRVTGCNTEHHCIGGGGYFPEASPQQCGDFSGFDW 
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FIGURE 297 

GCGGAGCCGGCGCCGGCTGCGCAGAGGAGCCGCTCTCGCCGCCGCCACCTCGGCTGGGAGCC 
CACGAGGCTGCCGCATCCTGCCCTCGGAACAATGGGACTCGGCGCGCGAGGTGCTTGGGCCG 
CGCTGCTCCTGGGGACGCTGCAGGTGCTAGCGCTGCTGGGGGCCGCCCATGAAAGCGCAGCC 
ATGGCGGCATCTGCAAACATAGAGAATTCTGGGCTTCCACACAACTCCAGTGCTAACTCAAC 
AGAGACTCTCCAACATGTGCCTTCTGACGATACA^TGAAACTTCCAACAGTACTGTGAAAC 
CACCAACTTCAGTTGCCTCAGACTCCAGTAATACAACGGTCACCACCATGAAACCTACAGCG 
GCATCTAATACAACAACACCAGGGATGGTCTCAACAAATATGACTTCTACCACCTTAAAGTC 
TACACCCAAAACAACAAGTGTTTCACAGAACACATCTCAGATATCAACATCCACAATGACCG 
TAACCCACAATAGTTCAGTGACATCTGCTGCTTCATCAGTAACAATCACAACAACTATGCAT 
TCTGT^AGCAT^AGAAAGGATCAAAATTTGATACTGGGAGCTTTGTTGGTGGTATTGTATTAAC 
GCTGGGAGTTTTATCTATTCTTTACATTGGATGCAAAATGTATTACTCAAGAAGAGGCATTC 
GGTATCGAACCATAGATGAACATGATGCCATCATTTAAGGAAATCCATGGACCAAGGATGGA 
ATACAGATTGATGCTGCCCTATCAATTAATTTTGGTTTATTAATAGTTTAAAACAATATTCT 
CTTTTTGAAAATAGTATAAACAGGCCATGCATATAATGTACAGTGTATTACGTAAATATGTA 
AAGATTCTTCAAGGTAACAAGGGTTTGGGTTTTGAAATAAACATCTGGATCTTATAGACCGT 
TCATACAATGGTTTTAGCAAGTTCATAGTAAGACAAACAAGTCCTATCTTTTTTTTTTGGCT 
GGGGTGGGGGCATTGGTCACATATGACCAGTAATTGAAAGACGTCATCACTGAAAGACAGAA 
TGCCATCTGGGCATACAAATAAGAAGTTTGTCACAGCACTCAGGATTTTGGGTATCTTTTGT 
AGCTCACATAAAGAACTTCAGTGCTTTTCAGAGCTGGATATATCTTAATTACTAATGCCACA 
CAGAAATTATACAATCAAACTAGATCTGAAGCATAATTTAAGAAAAACATCAACATTTTTTG 
TGCTTTAAACTGTAGTAGTTGGTCTAGAAACAAAATACTCC 
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FIGURE 298 

MGLGARGAWAALLLGTLQVLALLGAAHESAAMAASANIENSGLPHNSSANSTETLQHVPSDH 
TNETSNSTVKPPTSVASDSSNTTVTTMKPTAASNTTTPGMVSTNMTSTTLKSTPKTTSVSQN 
TSQISTSTMTVTHNSSVTSAASSVTITTTMHSEAKKGSKFDTGSFVGGIVLTLGVLSILYIG 
CKMYYSRRG I RYRT I DEHDA I I 



WO 99/63088 ^ I PCT/US99/12252 

FIGURE 299 

CAGCCGGGTCCCAAGCCTGTGCCTGAGCCTGAGCCTGAGCCTGAGCCCGAGCCGGGAGCCGG 
TCGCGGGGGCTCCGGGCTGTGGGACCGCTGGGCCCCCAGCGATGGCGACCCTGTGGGGAGGC 
CTTCTTCGGCTTGGCTCCTTGCTCAGCCTGTCGTGCCTGGCGCTTTCCGTGCTGCTGCTGGC 
GCAGCTGTCAGACGCCGCCAAGAATTTCGAGGATGTCAGATGTAAATGTATCTGCCCTCCCT 
ATAAAGAAAATTCTGGGCATATTTATAATAAGAACATATCTCAGAAAGATTGTGATTGCCTT 
CATGTTGTGGAGCCCATGCCTGTGCGGGGGCCTGATGTAGAAGCATACTGTCTACGCTGTGA 
ATGCAAATATGAAGAAAGAAGCTCTGTCACAATCAAGGTTACCATTATAATTTATCTCTCCA 
TTTTGGGCCTTCTACTTCTGTACATGGTATATCTTACTCTGGTTGAGCCCATACTGAAGAGG 
CGCCTCTTTGGACATGCACAGTTGATACAGAGTGATGATGATATTGGGGATCACCAGCCTTT 
TGCAAATGCACACGATGTGCTAGCCCGCTCCCGCAGTCGAGCCAACGTGCTGAACAAGGTAG 
AATATGCACAGCAGCGCTGGAAGCTTCAAGTCCAAGAGCAGCGAAAGTCTGTCTTTGACCGG 
CATGTTGTCCTCAGCTAATTGGGAATTGAATTCAAGGTGACTAGAAAGAAACAGGCAGACAA 
CTGGAAAGAACTGACTGGGTTTTGCTGGGTTTCATTTTAATACCTTGTTGATTTCACCAACT 
GTTGCTGGAAGATTCAAAACTGGAAGCAAAAACTTGCTTGATTTTTTTTTCTTGTTAACGTA 
ATAATAGAGACATTTTTAAAAGCACACAGCTCAAAGTCAGCCAATAAGTCTTTTCCTATTTG 
TGACTTTTACTAATAAAAATAAATCTGCCTGTAAATTATCTTGAAGTCCTTTACCTGGAACA 
AGCACTCTCTTTTTCACCACATAGTTTTAACTTGACTTTCAAGATAATTTTCAGGGTTTTTG 
TTGTTGTTGTTTTTTGTTTGTTTGTTTTGGTGGGAGAGGGGAGGGATGCCTGGGAAGTGGTT 
AACAACTTTTTTCAAGTCACTTTACTAAACAAACTTTTGTAAATAGACCTTACCTTCTATTT 
TCGAGTTTCATTTATATTTTGCAGTGTAGCCAGCCTCATCAAAGAGCTGACTTACTCATTTG 
ACTTTTGCACTGACTGTATTATCTGGGTATCTGCTGTGTCTGCACTTCATGGTAAACGGGAT 
CTAAAATGCCTGGTGGCTTTTCACAAAAAGCAGATTTTCTTCATGTACTGTGATGTCTGATG 
CAATGCATCCTAGAACAAACTGGCCATTTGCTAGTTTACTCTAAAGACTAAACATAGTCTTG 
GTGTGTGTGGTCTTACTCATCTTCTAGTACCTTTAAGGACAAATCCTAAGGACTTGGACACT 
TGCAATAAAGAAATTTTATTTTAAACCCAAGCCTCCCTGGATTGATAATATATACACATTTG 
TCAGCATTTCCGGTCGTGGTGAGAGGCAGCTGTTTGAGCTCCAATATGTGCAGCTTTGAACT 
AGGGCTGGGGTTGTGGGTGCCTCTTCTGAAAGGTCTAACCATTATTGGATAACTGGCTTTTT 
TCTTCCTATGTCCTCTTTGGAATGTAACAATAAAAATAATTTTTGAAACATCAA 
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FIGURE 300 

MATLWGGLLRLGSLLSLSCLALSVLLLAQLSDAAKNFEDVRCKCICPPYKENSGHIYNKNIS 
QKDCDCLHWEPMPVRGPDVEAYCLRCECKYEERSSVTIKVTI 1 1 YLS I LGLLLLYMVYLTL 
VEPILKRRLFGHAQLIQSDDDIGDHQPFANAHDVLARSRSRANVLNKVEYAQQRWKLQVQEQ 
RKSVFDRHWLS 
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FIGURE 301 
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GCACCTGCGACCACCGTGAGCAGTCATGGCGTACTCCACAGTGCAGAGAGTCGCTCTGGCTT 
CTGGGCTTGTCCTGGCTCTGTCGCTGCTGCTGCCCAAGGCCTTCCTGTCCCGCGGGAAGCGG 
CAGGAGCCGCCGCCGACACCTGAAGGAAAATTGGGCCGATTTCCACCTATGATGCATCATCA 
CCAGGCACCCTCAGATGGCCAGACTCCTGGGGCTCGTTTCCAGAGGTCTCACCTTGCCGAGG 
CATTTGCAAAGGCCAAAGGATCAGGTGGAGGTGCTGGAGGAGGAGGTAGTGGAAGAGGTCTG 
ATGGGGCAGATTATTCCAATCTACGGTTTTGGGATTTTTTTATATATACTGTACATTCTATT 
TAAGGTAAGTAGAATCATCCTAATCATATTACATCAATGAAAATCTAATATGGCGATAAAAA 
TCATTGTCTACATTAAAACTTCTTATAGTTCATAAAATTATTTCAAATCCATCATCTCTTTA 
AATCCTGCCTCCTCTTCATGAGGTACTTAGGATAGCCATTATTTCAGTTTCACATAAGAATG 
TTTACTCAATGTTTAAGTGTTTTGCCCCAAAATTCACAACTAACAAGGCAGAACTAGGACTT 
GAACATGGATCTTTTGGTTCTTAATCCAGTGAGTGATACAATTCAATGCACTCCCCTGCCA 
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FIGURE 302 

MAYSTVQRVALASGLVIjALSLLLPKAFLSRGKRQEPPPTPEGKLGRFPPMMHHHQAPSDGQT 
PGARFQRSHLAEAFAKAKGSGGGAGGGGSGRGLMGQIIPIYGFGIFLYILYILFKVSRIILI 
ILHQ 
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FIGURE 303 
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CGGCTCGAGTGCAGCTGTGGGGAGATTTCAGTGCATTGCCTCCCCTGGGTGCTCTTCATCTT 
GGATTTGAAAGTTGAGAGCAGCATGTTTTGCCCACTGAAACTCATCCTGCTGCCAGTGTTAC 
TGGATTATTCCTTGGGCCTGAATGACTTGAATGTTTCCCCGCCTGAGCTAACAGTCCATGTG 
GGTGATTCAGCTCTGATGGGATGTGTTTTCCAGAGCACAGAAGACAAATGTATATTCAAGAT 
AGACTGGACTCTGTCACCAGGAGAGCACGCCAAGGACGAATATGTGCTATACTATTACTCCA 
ATCTCAGTGTGCCTATTGGGCGCTTCCAGAACCGCGTACACTTGATGGGGGACATCTTATGC 
AATGATGGCTCTCTCCTGCTCCAAGATGTGCAAGAGGCTGACCAGGGAACCTATATCTGTGA 
AATCCGCCTCAAAGGGGAGAGCCAGGTGTTCAAGAAGGCGGTGGTACTGCATGTGCTTCCAG 
AGGAGCCCAAAGAGCTCATGGTCCATGTGGGTGGATTGATTCAGATGGGATGTGTTTTCCAG 
AGCACAGAAGTGAAACACGTGACCAAGGTAGAATGGATATTTTCAGGACGGCGCGCAAAGGA 
GGAGATTGTATTTCGTTACTACCACAAACTCAGGATGTCTGTGGAGTACTCCCAGAGCTGGG 
GCCACTTCCAGAATCGTGTGAACCTGGTGGGGGACATTTTCCGCAATGACGGTTCCATCATG 
CTTCAAGGAGTGAGGGAGTCAGATGGAGGAAACTACACCTGCAGTATCCACCTAGGGAACCT 
GGTGTTCAAGAAAACCATTGTGCTGCATGTCAGCCCGGAAGAGCCTCGAACACTGGTGACCC 
CGGCAGCCCTGAGGCCTCTGGTCTTGGGTGGTAATCAGTTGGTGATCATTGTGGGAATTGTC 
TGTGCCACAATCCTGCTGCTCCCTGTTCTGATATTGATCGTGAAGAAGACCTGTGGAAATAA 
GAGTTCAGTGAATTCTACAGTCTTGGTGAAGAACACGAAGAAGACTAATCCAGAGATAAT^AG 
AAAAACCCTGCCATTTTGAAAGATGTGAAGGGGAGAAACACATTTACTCCCCAATAATTGTA 
CGGGAGGTGATCGAGG AAGAAGAAC CAAGTG AAAAAT C AGAGG C C AC CT AC ATGAC CATGC A 
CCCAGTTTGGCCTTCTCTGAGGTCAGATCGGAACAACTCACTTGAAAAAAAGTCAGGTGGGG 
GAATGCCAAAAACACAGCAAGCCTTTTGAGAAGAATGGAGAGTCCCTTCATCTCAGCAGCGG 
TGGAGACTCTCTCCTGTGTGTGTCCTGGGCCACTCTACCAGTGATTTCAGACTCCCGCTCTC 
CCAGCTGTCCTCCTGTCTCATTGTTTGGTCAATACACTGAAGATGGAGAATTTGGAGCCTGG 
CAGAGAGACTGGACAGCTCTGGAGGAACAGGCCTGCTGAGGGGAGGGGAGCATGGACTTGGC 
CTCTGGAGTGGGACACTGGCCCTGGGAACCAGGCTGAGCTGAGTGGCCTCAAACCCCCCGTT 
GGATCAGACCCTCCTGTGGGCAGGGTTCTTAGTGGATGAGTTACTGGGAAGAATCAGAGATA 
AAAACCAACCCAAATCAA 
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FIGURE 304 

MFCPLKLILLPVLLDYSLGLND 

EHAKDEYVLYYYSNLSVPIGRFQNRVHLMGDILCNDGSLLLQDVQEADQGTYICEIRLKGES 
QVFKKAWLH VL PEE P KELMVHVGGL I QMGCVFQS TE VKHVTKVE W I FS GRRAKE E I VFRY Y 
HKLRMSVEYSQSWGHFQNRWLVGDIFRNDGSIMLQGVRESDGGNYTCSIHLGNLVFKKTIV 
LHVSPEEPRTLVTPAALRPLVLGGNQLVIIVGIVCATILLLPVLILIVKKTCGNKSSVNSTV 
LVKNTKKTNPEIKEKPCHFERCEGEKHIYSPIIVREVIEEEEPSEKSEATYMTMHPVWPSLR 
SDRNNSLEKKSGGGMPKTQQAF 
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FIGURE 305 

CTATGAAGAAGCTTCCTGGAAAACAATAAGC^^^ 

GTTCTACCCTACTAAAGACAGGAAGATCATAAACTGACAGATACTGAAATTGTAAGAGTTGG 
AAACTACATTTTGCAAAGTCATTGAACTCTGAGCTCAGTTGCAGTACTCGGGAAGCCA^CA 
GGATGAAGATGGATACATCACCTTAAATATTAAAACTCGGAAACCAGCTCTCGTCTCCGTTG 
GCCCTGCATCCTCCTCCTGGTGGCGTGTGATGGCTTTGATTCTGCTGATCCTGTGCGTGGGG 
ATGGTTGTCGGGCTGGTGGCTCTGGGGATTTGGTCTGTCATGCAGCGCAATTACCTACAAGA 
TGAGAATGAAAATCGCACAGGAACTCTGCAACAATTAGCAAAGCGCTTCTGTCAATATGTGG 
TAAAACAATCAGAACTAAAGGGCACTTTCAAAGGTCATAAATGCAGCCCCTGTGACACAAAC 
TGGAGATATTATGGAGATAGCTGCTATGGGTTCTTCAGGCACAACTTAACATGGGAAGAGAG 
TAAGCAGTACTGCACTGACATGAATGCTACTCTCCTGAAGATTGACAACCGGAACATTGTGG 
AGTACATCAAAGCCAGGACTCATTTAATTCGTTGGGTCGGATTATCTCGCCAGAAGTCGAAT 
GAGGTCTGGAAGTGGGAGGATGGCTCGGTTATCTCAGAAAATATGTTTGAGTTTTTGGAAGA 
TGGAAAAGGAAATATGAATTGTGCTTATTTTCATAATGGGAAAATGCACCCTACCTTCTGTG 
AGAACAAACATTATTTAATGTGTGAGAGGAAGGCTGGCATGACCAAGGTGGACCAACTACCT 
TAATGCAAAGAGGTGGACAGGATAACACAGATAAGGGCTTTATTGTACAATAAAAGATATGT 
ATGAATGCATCAGTAGCTGAAAAAAAAAAAAAA 
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FIGURE 306 

MQDEDG YI TLN I KTRKPALVS VGPASSS WWRVMAL I LL I LCVGMWGL VALG I WS VMQRNYL 
QDENENRTGTLQQLAKRFCQYVVKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHNLTWE 
ESKQYCTDMNATLLKIDNRNIVEYIKARTHLIRWGLSRQKSNEVWKWEDGSVISENMFEFL 
EDGKGNMNCAYFHNGKMHPTFCENKHYLMCERKAGMTKVDQLP 
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